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ARTIFICIAL MATRIX ATTACHMENT REGION FOR INCREASING 
EXPRESSION OF GENES INTRODUCED IN PLANT CELLS 

The present invention relates to plant molecular 
5 biology, and in particular to technology for enhancing 
the expression of genes introduced in transformed plant 
cells . 

Through the use of recombinant DNA technology and 
genetic engineering, it has become possible to introduce 

10 desired DNA sequences into plant cells to allow for the 
expression of proteins of interest. Plants with 
genetically engineered traits, such as, for example, 
insect resistance, disease resistance, drought 
resistance, herbicide resistance, or metabolic 

15 alterations that increase or modify production of useful 
plant products, offer great promise of improving 
agriculture . 

Obtaining desired levels of expression of DNA 
introduced into plant cells remains a challenge. One 

20 problem, referred to as "position effect" variation, is 
the variation in expression of the same gene in 
independent transf ormants . The use of naturally 
occurring DNA sequences called matrix attachment regions 
or scaffold attachment regions to combat this problem was 

25 proposed in U.S. Patent 5,773,689 and in WO 94/24293. 

The present invention provides a novel synthetic DNA 
molecule comprising bp 11 to 309 of SEQ ID NO: 1 that is 
useful as a matrix attachment region to increase 
expression of genes introduced in transformed plants. 

30 In another of its aspects, the invention provides a 

DNA construct comprising, in the 5' to 3 ' direction: a 
transcription initiation region functional in plant 
cells, a structural gene operatively associated with the 
transcription initiation region, a 3' untranslated 

35 region, and a matrix attachment region comprised of bp 11 
to 309 of SEQ ID NO: 1 positioned either 5' to said 
transcription initiation region or 3 ' to said structural 
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gene. In a preferred embodiment, a first matrix 
attachment region comprised of bp 11 to 309 of SEQ ID 
NO: 1 is positioned 5' to said transcription initiation 
region and a second matrix attachment region comprised of 
5 bp 11 to 309 of SEQ ID NO: 1 is positioned 3' to said 3' 
untranslated region. 

In a particularly preferred embodiment, the matrix 
attachment region of the invention comprises two or more 
tandem copies of bp 11 to 309 of SEQ ID N0:1. 
10 BRIEF DESCRIPTION OF THE DRAWINGS 

FIG- 1 is a diagram showing the strategy for 
assembling the artificial MAR of SEQ ID N0:1. 

FIG. 2 is a schematic representation of the rice 
transformation construct ArGOS2Af, which contains the MAR 
15 dimer. 

FIG. 3 is a graph comparing relative GUS activity 
for multiple rice transformation events using non-MAR 
containing construct G0S2 and construct ArG0S2Af , which 
contains an artificial MAR dimer. 
20 FIG. 4 is a graph showing the effect of the 

artificial MAR on ranges of expression of the GUS 
reporter gene in transgenic rice plants. 

FIG. 5 is a schematic representation of the 
Arahidopsis transformation constructs ArAct2Af and 
25 aaaaAfAct2Af . ArAct2Af contains copies of the MAR dimer 
in opposite orientations flanking the reporter gene. 
AfAct2Af contains copies of the MAR dimer in the same 
orientation flanking the reporter gene. 

FIG. 6 is a graph comparing relative GUS activity 
30 for multiple Arabidopsis transformation events using non- 
MAR containing construct Act2 and constructs ArAct2Af and 
AfAct2Af, which contain an artificial MAR dimer. 

FIG 7 is a graph showing the effect of the 
artificial MAR on the range of expression of the GUS 
35 reporter gene in transgenic Arabidopsis plants. 

DESCRIPTION OF THE SEQUENCES 



2 
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SEQ ID N0:1 describes the artificial MAR of the 
invention. 





SEQ 


I D 


NOS 


:2 


to 4 describe ARBP sites. 




SEQ 


I D 


NO : 


5 describes an ATF site. 


5 


SEQ 


I D 


NO : 


6 describes a 


: BEAF-32 site. 




SEQ 


ID 


NOS 


: 7 


to 9 describe topoisomerase II sites 




SEQ 


ID 


NO : 


10 


describes 


an unwinding sequence. 




SEQ 


ID 


NOS 


: 11 


. to 17 describe SATB I sites. 




SEQ 


I D 


NO : 


_L o 


describes 


exemplary bending DNA. 


10 


SEQ 


ID 


NO : 


19 


describes 


an exemplary A/T tract. 




SEQ 


I D 


NO : 


20 


describes 


synthetic MAR-A. 




SEQ 


ID 


NO: 


21 


describes 


synthetic MAR-B. 




SEQ 


ID 


NO: 


22 


describes 


synthetic MAR-C. 




SEQ 


ID 


NO: 


23 


describes 


synthetic MAR-D. 


15 


SEQ 


ID 


NO: 


24 


describes 


synthetic MAR-E. 




SEQ 


ID 


NO: 


25 


describes 


synthetic MAR-F. 




SEQ 


ID 


NO: 


26 


describes 


the 3 ' MAR dimer in 



pArG0S2Af-hpt and ArAct2Af-bin 

SEQ ID NO: 27 describes rice transformation vector 



20 pGOS2-hpt. 

SEQ ID NO: 28 describes rice transformation vector 
pArG0S2Af-hpt . 

SEQ ID NO: 2 9 describes the 5' MAR dimer in 
pArG0S2Af-hpt and ArAct2Af -bin . 
25 SEQ ID NO: 30 describes dicot transformation vector 

pAct2--bin. 

SEQ ID NO: 31 describes dicot transformation vector 
pArAct2Af -bin . 

SEQ ID NO: 32 describes dicot transformation vector 
30 pAfAct2Af "bin. 

SEQ ID NO: 33 describes the 3' MAR dimer in 
pAfAct2Af-bin- 

DETAILED DESCRIPTION OF THE INVENTION 
Eukaryotic nuclei are highly organized structures in 
35 which the entire genetic information has to be accessible 
in an orderly manner for replication, transcription and 



3 
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other cellular events (Lewin, 1994; Dillon and Grosveld, 
1994; Jackson, 1995; Wolffe, 1994). Genes are typically 
organized in chromatin loops of various sizes that are 
attached to the proteinaceous nuclear matrix at locations 
5 known as matrix attachment regions (MARs). MARs are often 
located in non-transcribed regions of genes and are 
thought to form the physical boundaries of individual DNA 
loops. In several cases, MARs were shown to reduce 
position effect in transgenic organisms. The chicken 

10 lysozyme MAR was shown to increase expression, reduce 
variance and make expression of an adjacent gene copy 
number dependent in stably transfected cells (Stief et 
al., 1989) in transgenic mice (Bonifer et al., 1990, 
McKnight et al., 1992) and in transgenic tobacco plants 

15 (Mlynarova et al., 1994; Mlynarova et ai., 1995). 

However, not all MARs have these effects on gene 
expression. Two minimal Drosophila MARs (one located 
between the histone HI and H3 genes, and the other near 
the heat shock HSP70 genes) stimulated expression more 

20 than 10-fold in stably transformed cells, but the 

presence of these MARs did not reduce position effect 
(Poljak et al., 1994). MARs from the apolipoprotein 
domain increased expression and reduced position effect 
in low-copy transf ormants , but expression in multicopy 

25 transf ormants was strongly repressed. (Kalos and 

Fournier, 1995) . When a Drosophila ftz MAR was placed in 
a different chromosomal location, it did not reorganize 
chromatin structure and the chromatin fragment containing 
the MAR could be easily eluted from the nucleus, 

30 indicating that introduced MARs do not necessarily form 
chromatin domains (Eggert and Jack, 1991) . In contrast, 
MARs flanking the immunoglobulin m heavy chain locus 
enhancer were required for high levels of expression and 
the formation of an extended DNase I sensitive domain in 

35 transgenic B lymphocytes, but not in stably transfected 
tissue culture cells (Forrester et al., 1994). 
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Results using MARs in transgenic plants have been 
similarly complex (Spiker and Thompson, 1996) . A yeast 
MAR increased expression levels in stably transformed 
tobacco callus lines, but no correlation between copy 
5 number and expression level could be found (Allen et al., 
1993) . In contrast, the MAR element from the soybean heat 
shock gene Gmhspll . 6-L was shown to be capable of 
increasing expression levels but had little effect on 
variability (Schoffl et ai., 1993). A soybean MAR 

10 flanking a reporter gene construct reduced variability of 
expression when compared to a construct lacking MARs, but 
also reduced expression levels when present 5* and 3' of 
a reporter gene construct in transgenic tobacco callus 
(Breyne et ai., 1992). It is possible that individual 

15 MARs can have different functional and structural 

properties in addition to their matrix binding ability 
(Breyne et al., 1994). 

MARs are usually 300 to 2000 base pairs in length, 
are rich in adenosine and thymine residues and often 

20 contain certain conserved sequence elements and 
structural features. Most MARs described in the 
literature are not obtained from plants, but it has been 
well documented that MARs from other organisms bind plant 
scaffolds and vice versa (Dietz et al., 1994; Breyne et 

25 al. , 1992) . 

Table 1 describes sequence elements present in MARs 
described in the literature, including the following 
plant MARs: soybean heat shock protein gene MAR, (Schoffl 
et al., 1993); a petunia MAR (Dietz et al., 1994); the 

30 pea plastocyanin gene MAR (Slatter et ai., 1991); the 

maize Adhl gene 5' and 3' MARs (Avramova and Bennetzen, 
1993; Avramova et ai., 1995); the b-phaseolin gene 5' and 
3' MARs (van der Geest et ai., 1994. 

Table 1 



5 
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element 


characteristics 


sequence 


SEQ 

ID 

NO: 


reported 
in 

plant 
MARs 


ARBP 


ARBP (attachment-region 
binding protein) from 

chicken which binds MARs 
from Drosophila f mouse, 
chicken and human genes 
in a cooperative manner 
(von Kries et al., 1991; 
Buhrmester et ai., 1995) 


ATTTCASTTGTAAAA 

TGCAGGTGTCCTT 

TGGGGGTGTAAAA 


2 
3 
4 


no 

yes 

yes 


jts. 1 r 


ATF* <=; orrn on CP s bind 

transcription factors of 
the ATF family, two of 
which were shown to be 
integral components of 
the nuclear matrix (Stein 
et al. , 1991) 


TGACGTCCATG 


5 


no 


BEAF-32 


Beaf-32 (boundary 
element-associated factor 
of 32 kDa molecular mass) 

binds to the locus 
boundary element scs' 
from Drosophila, which 
can insulate reporter 
genes from position 
effect variegation (Zhao 
et al., 1995) 


CGATA 


6 


yes 


topo- 

i s ome ir a. s e 

II 


Topoisomerase II is a 
major component of the 
nuclear matrix and 
topoisomerase II binding 
sites have been found in 
most MARs to date (Sander 
and Hsieh, 1985; 
Boulikas, 1995) 


CNNGYNGKTNYNY 

A ^MATC^rr? YWYATCRT 

.rt.01\lrt.J. UV-^O X VV J.X^ J. N^Xxi 

G TN WAKAT TN AT NN R 


7 
8 

9 


yes 

no 

yes 


unwinding 
sequence 


Unwinding sequences such 

as 7VATATATTT cause DNA 
unwinding in vivo^ which 
is important for MAR 
function (Bode et al., 
1992) 


AATATATTT 


10 


yes 


SATBl 


SATBl is a protein 
isolated from human 
thymus tissue that 
selectively binds MAR 
sequences consisting of 
A's, T's and C s in one 
strand (Dickinson et ai., 
19 92; Nagagomi et al,, 
1994) 


TTCTAATATAT 

ATAATCTTC 

TTATTATTTA 

TATAAAAA 

AAGATTATATA 

T T T T AAT G AG AT AAT AA 

TATAATCTTC 


11 
12 
13 
14 
15 
16 
17 


no 

no 

yes 

yes 

no 

no 

no 


bending 
DNA 


Curved DNA regions are 
often found in or near 
MARs (Bode et al,, 1995) 


AAANNNNNNNAAA 


18 


yes 


stem-loop 


Stem loops create small 
single-stranded regions 
which are important for 
MAR function (Boulikas 
and Kong, 1993) 






yes 


oligo A/T 
tracts 


These can create bent DNA 
that may attract protein 
complexes involved in 
topoisomerization. 


AAAAAAA 


19 


yes 



6 



wo 00/32800 



PCT/US99/28123 



element 


characteristics 


sequence 


SEQ 

ID 

NO: 


reported 
in 

plant 
MARS 




recombination, 
transcription or 
replication (Travers, 
1990) 









K: G or T, M: A or C, N: A, C, G or T, R: A or G, S: C or 



G, W: A or T, Y: C or T. 

The present invention utilizes a subset of the 
features described in Table 1 in a novel artificial MAR. 
5 The sequence of the 327 bp artificial MAR is given in SEQ 
ID N0:1. The artificial MAR was designed as a sequence 
flanked by Bglll and BamRl restriction sites ^ which are 
included in SEQ ID N0:1, but which are not critical to 
the function of the MAR. The functional portion of the 
10 MAR comprises bp 11 to 309 of SEQ ID N0:1. 

The following features are found in SEQ ID N0:1: 



7 



wo 00/32800 



PCT/US99/28123 



Feature 


location (bp) 


Belli 


5-10 


BEAF-32 


11-15 


SATBI 


24-34 


unwinding 


28-36 


topoisomerase II 


44-59 


ATF site 


60-69 


A/T tract 


70-85 


BEAF-32 


86-90 


stem- loop 


93-101/117-124 


unwinding 


105-113 


topoisomerase II 


125-139 


SATBI 


149-159 


unwinding 


153-161 


BEAF-32 


164-168 


SATBI 


185-195 


<?tem-loot5 


208-216/231-239 


unwinding 


219-227 


A/T tract 


241-253 


topoisomerase II 


268-283 


curved (bending) DNA 


284-294 


ARBP site 


295-309 


BamHI 


318-323 



The 3' UTR, or 3' untranslated region^ that is 
employed in constructs of the invention is one that 
confers efficient processing of the mRNA, maintains 
5 stability of the message and directs the addition of 
adenosine ribonucleotides to the 3' end of the 
transcribed mRNA sequence. The 3' UTR may be native 
with the promoter region, native with the structural 
gene, or may be derived from another source. Suitable 3' 
10 UTRs include, but are not limited to: the per5 3' UTR, 
and the 3' UTR of the nopaline synthase (nos) gene. 

Example 1 
Synthesis of artificial MAR 
To construct the artificial MAR, six individual 
15 oligonucleotides were synthesized and assembled by PGR. 
The sequences for the six oligonucleotides, referred to 
hereinafter as MAR-A, MAR-B, MAR-C, MAR-D, MAR-E, and 
MAR-F, are given in the Sequence Listing as SEQ ID NOS: 
20 through 25, respectively- A 15 bp overlap between 
20 adjacent oligonucleotides allowed assembly of the MAR by 
PGR, using the strategy shown in Figure 1. 

50545-P 8 
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The GeneAmp™ PGR Reagent Kit with AmpliTaq DNA 
Polymerase (Perkin Elmer, Norwalk, GT) was used for the 
DNA amplification. Twenty cycles of PGR (denaturation : 
30 sec at annealing: 60 sec at 52''G and extension: 

5 60 sec at lO^'C) with primers MAR-G and MAR-D were followed 
by 20 cycles of PGR with primers MAR-B and MAR-E, using 
the product from the first reaction as template for the 
second reaction. The 231 bp product of this reaction was 
purified from a low melting point agarose gel and used as 
10 a template for 20 cycles of PGR with primers MAR-A and 
MAR-F. The 327 bp product from this reaction was 
subcloned into pCR2 . 1 using the TA Cloning™ Kit 
(Invitrogen, San Diego^ CA) and the sequence was verified 
by sequencing. 

15 A dimer consisting of two tandem copies of the 

Bglll / BamHl fragment was constructed in the BamHI site of 
pBluescript™ SK- (Stratagene, La Jolla, CA) . The 
sequence of the dimer is bp 5-630 of SEQ ID NO: 26, 

EXAMPLE 2 

20 Binding of artificial MAR to nuclear scaffolds 

A. Controls 

Two DNA fragments of similar size and nucleotide 
composition as the artificial MAR were amplified from 
plant DNA to serve as controls in the binding assay. 

25 These fragments were a 657 bp fragment from the 3' end of 
a maize gene Gpal (glyceraldehyde-3-phosphate 
dehydrogenase subunit A, GenBank accession number X15408, 
bases 4516 to 5173, Quigley et al., 1989), and a 488 bp 
fragment from the 5' flanking region of a 19 kD alpha 

30 zein gene (GenBank accession number X05911, bases 339 to 
827, Kriz et al., 1987). Table 2 compares the features 
present in the artificial MAR and control fragments. 

Table 2 



element 


Artificial 
MAR dimer 


Gpal 
control 


zein 
control 


ARBP sites 


2 


0 


0 
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element 


Artificial 
MAR dimer 


Gpal 
control 


zein 
control 


ATF sites 


2 


0 


0 


BEAF-32 sites 


6 


1 


1 


topoisomerase II 

sites 


6 


0 


0 


unwinding sequence 
sites 


8 


0 


0 


SATBl sites 


6 


0 


0 


bending DNA sites 


yes 


yes 


yes 


stem-loop sites 


A 


1 


0 


oligo A/T tracts 


2 


0 


1 


fragment size in 
binding assay 


632 


657 


488 


% A + T 


71 


63 


66 


strength of binding 

to nuclear scaffolds 


+++++ 







B- Preparation of nuclei from maize leaves 

Nuclei for use in isolating nuclear scaffolds were 
prepared from young maize leaves by adaptation of a published 
protocol (Hall et al., 1991). Nuclei were counted and checked 
5 for integrity by microscopic examination of DAPI stained 
aliquots. Only high quality nuclei were used to prepare 
nuclear scaffolds by lithium diiodosalicylate extraction. 

For nuclei purification, young maize leaves from V4 stage 
plants (fourth or fifth leaf) were harvested with a razor 

10 blade, washed and dried. After removing the midrib, leaves 
were frozen in liquid nitrogen, and ground to a fine powder 
with a mortar and pestle. The powdered leaf samples were 
transferred to a glass beaker, and 5 ml NlBl+PI (0.5 M 
hexylene glycol, 20 mM piperazine-N, N' -bis [2-ethanesulf onic 

15 acid] (PIPES), pH 6.5, 20 mM KCl, 7 mM 2-mercaptoethanol , 0.5 
mM ethylenediaminetetraacetic acid (EDTA) , 0.4 % Triton X-100^^ 
[Rohm & Haas Company, Philadelphia, PA], 0.05 mM spermine, 
0.125 mM spermidine, 1 mM phenylmethylsulf onyl fluoride 
(PMSF) , 1 |Lig/ml leupeptin, 1 |ag/ml aprotinin) was added per 

20 gram of leaf tissue. The leaf extract was filtered 

sequentially through 1900, 520, 125, 85 and 40 mm filters at 
4°C and filters were rinsed with 1 ml NIBl+PI per gram leaf to 
collect any nuclei that were trapped in the debris. Fifteen ml 
crude nuclear extract was loaded onto Percoll™ (Pharmacia 

25 Biotech, Piscataway, NJ) gradients consisting of 7 ml 40% 

Percoll in NIBl (0.5 M hexylene glycol, 20 mM PIPES, pH 6.5, 
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20 mM KCl, 7 mM 2 -mercaptoethanol , 0 . 5 mM EDTA, 0.4 % Triton^^ 
X-100, 0.05 mM spermine, 0.125 mM spermidine) and 5 ml 70% 
Percoll in NIBl. After centrif ugat ion for 15 min at 500xg at 
4°C the 40%/70% interface was collected with a sterile pasteur 
5 pipette and added to 2 volumes NIB2 (0.5 M hexylene glycol, 20 
mM PIPES, pH 6.5, 20 mM KCl, 7 mM 2 -mercaptoethanol , 0.5 mM 
EDTA, 0.05 mM spermine, 0.125 mM spermidine), taking care to 
avoid the pellet and other debris. Nuclei were concentrated by 
centrif ugation at 600xg for 10 min at 4^C. 

10 The nuclear pellet was resuspended in 20 ml NIB2 and 

centrif uged as before. This step was repeated one more time to 
wash away traces of Percoll. Nuclei were counted using a 
hemacytometer and resuspended in NIB2+PI/50% glycerol (0.5 M 
hexylene glycol, 20 mM PIPES, pH 6.5, 20 mM KCl, 7 mM 2- 

15 mercaptoethanol, 0 . 5 mM EDTA, 0.0 5 mM spermine, 0.12 5 mM 
spermidine, 50% glycerol, 1 mM PMSF, 1 |xg/ml leupeptin, 1 
|Lig/ml aprotinin) at 20 million nuclei/ml. Nuclei were stored 
at -80°C until used for scaffold preparation. 



20 C. Preparation of nuclear scaffolds 

Frozen nuclei were thawed and washed with 10 ml of 
NIB3+PI (0.5 M hexylene glycol, 20 mM PIPES, pH 6.5, 20 mM 
KCl, 7 mM 2-mercaptoethanol, 0.05 mM spermine, 0.125 mM 
spermidine, 1 mM PMSF, 1 |ig/ml leupeptin, 1 |Ltg/ml aprotinin) 

25 per 20 million nuclei. Nuclei were collected by centrif ugation 
at 600xg for 10 minutes, resuspended in 200 ml NIB3+PI in the 
presence of 1 mM CuSO^, and incubated for 10 min at 42^C to 
stabilize the nuclei. 

Histones were extracted by incubation in 10 ml HIB+PI 

30 (20 mM HEPES, pH 7.4, 100 mM lithium acetate, 10 mM LIS 

(lithium diiodosalicylate) , 0.1 % digitonin, 2 mM EDTA, 1 mM 
PMSF, 1 jxg/ml leupeptin, 1 jag/ml aprotinin) for 15 minutes at 
room temperature. The resulting nuclear halos were transferred 
to a centrifuge tube and pelleted at 4000xg for 10 minutes. 

35 Halos were washed twice with 10 ml HWB (20 mM Tris, pH 8, 70 
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mM NaCl, 20 mM KCl, 7 mM 2 -mercaptoethanol , 0.1 % digitonin, 
0.05 mM spermine, 0.125 mM spermidine) and once with D/BB+Pl 
(HWB + 10 mM MgCl2, 1 mM PMSF, 1 |ag/ml leupeptin, 1 j^g/ml 
aprotinin) to remove LIS. If halos did not pellet well, 
5 subsequent centrif ugation steps were done at 600Qxg using a 
slow brake setting. The quality of the halos was verified by 
SDS-PAGE gel to ensure that more than 95% of the histones were 
removed in the extraction procedure 

Washed nuclear halos were resuspended in 4 00 jxl D/BB+PI 

10 and 200 units of restriction enzymes (100 u each of EcoRl and 
Hindlll) were added, and incubated at ST'C for 2-3 hours on a 
rocking platform to keep the halos from settling- The 
restriction enzymes removed more than 7 0% of the nuclear DNA, 
producing nuclear scaffolds. Scaffolds were pelleted at 300xg 

15 and washed with HWB+PI (HWB + 1 mM PMSF, 1 jag/ml leupeptin, 1 
|Lig/ml aprotinin) . Nuclear scaffolds were resuspended in 400 |iil 
HWB+PI and separated into 100 |al aliquots (containing 5 
million nuclear equivalents) . 

20 D. Binding of artificial MAR to nuclear scaffolds 

100 |,il aliquots of scaffolds in HWB+PI were incubated 
with probe and E. coll competitor DNA at 37°C for 2-3 hours in 
siliconized microfuge tubes on a rocking platform shaker. 
After incubation, the supernatant fraction (containing unbound 

25 DNA fragments) and pellet fraction (containing scaffolds and 
bound DNA fragments) were separated via centrif ugation in a 
horizontal microfuge at 3000xg for 5 min. The pellet was 
washed once with 200 |li1 HWB to remove proteinase inhibitors, 
resuspended in 100 |il lysis buffer (10 mM Tris, pH 8, 10 mM 

30 EDTA, 0-5 % SDS, 0.5 mg/ml Proteinase K) and incubated 
overnight at room temperature. 

Equal fractions of the pellet and supernatant were 
separated on a 0.9% agarose gel, which was subsequently fixed, 
by soaking in 7% TCA for 20 min, dried and exposed to X-ray 
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film at room temperature and/or storage phosphor screens for 
the Phospholmager^"" SI (Molecular Dynamics, Sunnyvale, CA) . 

Plasmids containing the artificial MAR monomer or dimer 
or the control Gpal or zein sequences were digested with 
5 restriction enzymes that generate 5' overhang ends. The 
Klenow subunit of DNA polymerase I was used to fill the 
overhang with [a-^^PJdCTP (Amersham Life Science, Arlington 
Heights, IL) . The end-labeled DNA fragments were used as 
probes in the binding assay, i.e. the fragments were incubated 

10 with purified maize nuclear scaffolds in the presence of 

unlabeled E.coll competitor DNA and the relative binding of 
the inserts was determined. Relative amounts of nuclei, probe 
and unlabeled E, coli competitor DNA used in the binding assay 
were optimized to obtain maximal discrimination between 

15 strongly and weakly binding MARs . The optimal relative 

amounts were 2, 5 or 10 jag of unlabeled E. coli competitor 
DNA, 5 million nuclear equivalents of nuclear scaffolds, and 1 
fmole of digested and labeled plasmid per assay. 

The artificial MAR dimer bound very strongly to the 

20 nuclear scaffold preparation, even in the presence of high 
levels of competitor DNA. The monomer MAR also bound to 
nuclear scaffold preparations, albeit at a lower affinity. 
Neither control sequence was retained in the pellet fraction, 
even though they were similar to the artificial MARs in size 

25 and relative AT content. This suggests that the elements 
included in the artificial MAR facilitate binding. 

EXAMPLE 3 

EVALUATION OF THE ARTIFICIAL MAR IN RICE 
A. Rice Transformation Vectors 

30 pG0S2-hpt (SEQ ID NO: 27) is a rice transformation vector 

containing a hygromycin selectable marker driven by the 358 
promoter and a G0S2/GUS/nos cassette (G0S2 transcription 
initiation region/GUS structural gene/nos 3* untranslated 
region) . The GOS2 transcription inititation region in this 

35 construct is comprised of 1010 bp of promoter and 170 bp of 
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untranslated 5' leader interrupted by a 1100 bp intron (de 
Pater et ai . , 1992) . 

pArG0S2Af-hpt (SEQ ID NO:28) is a rice transformation 
vector identical to pG0S2~hpt except that it has the MAR dimer 
5 of SEQ ID NO: 29 positioned 5' to the G0S2 transcription 

initiation region and the MAR dimer of SEQ ID NO: 26 positioned 
3' to the nos 3* UTR. 

A schematic representation of the ArG0S2Af construct is 
shown in FIG 2 . 

10 B. Transformation of Rice 

For initiation of embryogenic callus, mature seeds of a 
Japonica cultivar, Taipei 309, were dehusked and surface- 
sterilized in 70% ethanol for 5-7 min. followed by soaking 30- 
45 min in 25% commercial bleach (2.6% sodium hypochlorite) 

15 with 0.02% Tween^" 20 (ICI Americas, Inc.) under vacuum. The 
seeds were then rinsed 5 times in sterile distilled water and 
placed on filter paper before transferring to induction media 
(NB) . The NB medium consisted of N6 macro elements {Chu, 
1978), B5 micro elements and vitamins (Gamborg et ai., 1968), 

20 300 mg/1 casein hydrolysate, 500 mg/1 L-proline, 500 mg/1 L- 
glutamine, 30 g/1 sucrose, 2 mg/1 2 , 4-dichloro-phenoxyacetic 
acid (2,4-D), and 2.5 g/1 Gelrite*^^ (Merck & Co . , Rawhay, NJ) 
with the pH adjusted to 5.8. The mature seed cultured on 
induction media were incubated in the dark at 28*^ C for three 

25 weeks. Primary callus induced from the scutellar region of 
mature embryo was transferred to fresh NB medium for further 
maintenance and thereafter maintained on a two week subculture 
period. 

To prepare DNA for blasting, about 140 \ig of plasmid DNA 
30 (pG0S2-hpt or pArGOSAf -hpt ) was precipitated onto 60 mg of 

gold particles- The plasmid DNA was precipitated onto 1.5-3.0 
micron (Aldrich Chemical Co., Milwaukee, WI) or 1.0 micron 
gold particles (Bio-Rad Laboratories, Hercules, CA) . The 
precipitation mixture included 60 mg of pre-washed gold 
35 particles, 300 pi of water/DNA (140 pg) , 74 pi of 2.5 M CaCl2, 
and 30 pi of 0.1 M spermidine. After adding the components in 
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the above order, the mixture was vortexed immediately, and 
allowed to settle for 2-3 min. The supernatant was pipetted 
off and discarded. The DNA-coated gold particles were 
resuspended in 1 ml of 100% ethanol and diluted to 17.5 mg 
5 DNA/7.5 mg gold per ml of ethanol for use in blasting 
experiments . 

For helium blasting, actively growing embryogenic callus 
cultures, 2-4 mm in size, were subjected to a high osmoticum 
treatment by placing callus on NB medium with 0.2 M mannitol 

10 and 0.2 M sorbitol (Vain et ai., 1993) for 4 hr before helium 
blasting. Following osmoticum treatment, callus cultures were 
transferred to blasting medium (NB+2% agar) and covered with a 
stainless steel screen (230 micron) . Helium blasting involved 
accelerating the suspended DNA-coated gold particles towards 

15 and into the prepared tissue targets. The device used was an 
earlier prototype to the one described in US Patent No. 
5,141,131, which is incorporated herein by reference, although 
both function in a similar manner. The callus cultures were 
blasted at different helium pressures (1750-2,250 psi) one to 

20 three times per target. After blasting, callus was 

transferred back to the high osmotic media overnight before 
placing on selection medium, which consisted of NB medium with 
30 mg/1 hygromycin. After 2 weeks, the cultures were 
transferred to fresh selection medium with higher 

25 concentrations of selection agent, i.e., NB+50 mg/1 hygromycin 
(Li et ai. , 1993) . 

Compact, white-yellow, embryogenic callus cultures, 
recovered on NB+50 mg/1 hygromycin, were regenerated by 
transferring to pre-regeneration (PR) medium+50 mg/1 

30 hygromycin- PR medium consisted of NB medium with 2 mg/1 
benzyl aminopurine (BAP) , 1 mg/1 naphthalene acetic acid 
(NAA) , and 5 mg/1 abscisic acid (ABA) . After 2 weeks of 
culture in the dark, they were transferred to regeneration 
(RN) medium- The composition of RN medium is NB medium with 3 

35 mg/1 BAP, and 0-5 mg/1 NAA. The cultures on RN medium were 
incubated for 2 weeks at 28° C under high fluorescent light 
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( 325-f t-candles ) . The plantlets with 2 cm shoots were 
transferred to 1/2 MS medium (Murashige and Skoog, 1962) with 
1/2 B5 vitamins, 10 g/1 sucrose, 0.05 mg/1 NAA, 50 mg/1 
hygromycin and 2.5 g/1 Gelrite™ adjusted to pH 5 . 8 in GA7 
5 vessels (Magenta Corp., Chicago, XL). When plantlets were 
established with well-developed root systems, they were 
transferred to soil [1 part Metro-Mix 360 ( Scott s-Sierra 
Horticultural Products Co., Marysville, OH) and 1 part top 
soil) and raised in a growth chamber (29/24''C day/night cycle, 
10 50-60% humidity, 12 h photoperiod) until they reached a height 
of 60 cm, at which point 2 leaves were harvested for 
quantitative GUS analysis, and the plants were transferred to 
the greenhouse to grow to maturity. 

15 C. Southern analyses 

Southern analysis was used to identify primary regenerate 
(Ro) rice lines lines that contained intact copies of the 
specific gene construct. 

A DNA probe specific for the coding region of the 

20 glucuronidase (GUS) gene construct was gel purified with the 
Qiaex II DNA purification kit (Qiagen Inc., Chatsworth, OA). 
Radiolabeled probe was prepared using the Ready-To-Go™ DNA 
labeling beads (Pharmacia LKB, Piscataway, NJ) with 50 
microcuries of [a^^PJdCTP (Amersham Life Science, Arlington 

25 Heights, IL) - 

Leaf material from Rq rice plants was harvested from two 
representatives from each line. Genomic DNA from the Ro plants 
was prepared from lyophilized tissue as described by Saghai- 
Maroof et al. (1984) . 

30 Four micrograms of rice DNA was digested with restriction 

enzyme to release the intact gene construct using conditions 
suggested by the manufacturer (Bethesda Research Laboratory, 
Gaithersburg, MD) and separated by agarose gel 

electrophoresis. The DNA was blotted onto nylon membranes as 
35 described by Southern (1975, 1989) . Radiolabeled probe DNA 

was hybridized to the genomic DNA on the blots using 50 ml of 
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minimal hybridization buffer (10% polyethylene glycol, 7% 
sodium dodecyl sulfate, 0 . 6x SSC, 10 mM sodium phosphate, 5 mM 
EDTA and 100 mg/ml denatured salmon sperm DNA) heated to GO'^C 
and mixed with the denatured radiolabeled probed prior to 
5 being added to the blots for overnight hybridization at 60^C. 
The blots were washed at 60°C in 0.25X SSC and 0.2% SDS for 45 
minutes, blotted dry and exposed to XAR-5 film with two 
intensifying screens overnight. 

Southern analysis was conducted on seventy ArG0S2Af Ro 

10 rice lines. The DNA from the Ro plants was digested with the 
restriction enzyme Xbal which, if the intact gene construct is 
present, should result in a 5.7 kb hybridization product when 
radiolabeled with a probe specific for the GUS coding region. 
The 5.7 kb fragment should consist of the artificial MAR in 

15 the reverse orientation, the G0S2 promoter, the GUS coding 

region, the nos 3' UTR and the artificial MAR in the forward 
orientation. The expected 5.7 kb hybridization product was 
detected in twenty-five of the seventy rice lines. All of the 
twenty-five lines had multiple hybridization products and two 

20 of the lines had identical complex hybridization patterns 

indicating that they are probably from the same transformation 
event . 

The non-Mar control lines, G0S2, were also analyzed by 
Southern analysis. The DNA from forty-eight G0S2 Ro lines was 

25 digested with the restriction enzymes EcoRl and Xbal which, if 
the intact gene is present, should result in a 4.4 kb 
hybridization product when radiolabeled with a probe specific 
for the GUS coding region. The 4.4 kb fragment would include 
1.6 kb of the G0S2 promoter, the GUS coding region, the nos 

30 3' UTR and the 35T promoter (the promoter used to drive the 
selectable marker gene). The expected 4.4 kb hybridization 
product was detected in twenty-eight of the forty-eight G0S2 
lines. Two of the lines had identical hybridization patterns 
and must have resulted from the same transformation event. 

35 Two of the lines contained genetic chimeras. 
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D- GUS analysis 

Analysis of rice was performed on young leaves of primary 
transf ormants, after plants had been grown 6-8 weeks in an 
environmentally controlled growth chamber and had reached a 
5 height of about 60 cm. Two independently regenerated rice 
plants were analyzed per transformation event. Individual 
transf ormants were analyzed by Southern blots to verify the 
presence of an intact copy of the transgene and determine 
whether each event displayed unique hybridization patterns, 

10 indicating independent transformation events. Plants lacking a 
complete copy of the transgene, chimeric events, or duplicated 
integration events were not included in the analysis. 

Results of the analysis are reported in FIGS 4 and 5. 
In FIG 4, error bars represent the standard deviation between 

15 the plants for each transformation event. Two samples were 

independently processed for each plant. In general, the level 
of expression of GUS in independent rice plants from each 
transformation event was similar (as demonstrated by the 
standard deviation of the results shown in FIG 4). 

20 FIG 5 reports the percent of transformation events 

expressing GUS in the indicated ranges. 



EVALUATION OF ARTIFICIAL MAR IN ARABIDOPSIS 
A. Arabidopsis Transformation Vectors 
25 Act2/GUS/nos (Act2 transcription initiation region/GUS 

structural gene/nos 3' UTR) constructs were made for testing 
in a dicot system {Arabidopsis) . Three vectors were made: 

pAct2-bin (SEQ ID NO: 30) is a binary vector containing a 
Act2/GUS/nos cassette, 19S/NPTII/orf 25polyA as a selectable 
30 marker, and 35S/GFP/nos as an independent reporter gene. 

pArAct2Af-"bin (SEQ ID NO: 31) is identical to pAct2-bin 
except that it has the MAR dimer of SEQ ID NO: 29 positioned 5' 
to the Act2 transcription initiation region, and the MAR dimer 
of SEQ ID NO: 26 positioned 3' to the nos 3' UTR. 
35 pAf Act2Af-bin (SEQ ID NO: 32) is identical to pAct2-bin 

except that it has the MAR dimer of SEQ ID NO: 2 6 positioned 5* 
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to the Act2 transcription initiation region the MAR diitier of 
SEQ ID NO: 33 positioned 3' to the nos 3' UTR. 

These vectors enabled testing of two orientations of the 
artificial MAR dimer in Arahidopsis . A schematic of the 
5 pArAct2Af -bin and pAf Act2Af -bin constructs is shown in Figure 
3. 



B. Arahidopsis transformation 

Arabidopsis transformation was performed according to a 

10 protocol provided by Pam Green (van Hoof and Green 1996) , 
which is an adaptation from protocols by Nicole Bechtold 
(Bechtold et al., 1993), Andrew Bent (Bent et al., 1994) and 
Takashi Araki (personal communication) . 

Seeds of ecotype Columbia were planted in 4 inch square 

15 pots, covered with window screen mesh, and grown under 

conditions of 16 hours light/ 8 hours dark at 22^C, fertilizing 
by subirrigation once a week. The fertilizer consisted of 5 
mM KNO3, 2.5 mM KPO4 (pH 5.5), 2 mM MgS04, 2 mM Ca(N03)2, 0.05 
mM Fe*EDTA, 0.07 mM boric acid, 0.014 mM MnCla, 0.005 mM CuS04, 

20 0,001 mM ZnSOo 0.0002 mM NaMo04, and 0.01 mM NaCl . Plants 

were thinned to 4 plants per pot and grown until several bolts 
emerged. When plants were ready to transform, the above soil 
parts were submerged in infiltration medium (2.2 g/1 MS salts, 
IX B5 vitamins, 50 g/1 sucrose, 2.5 mM MES, pH 5.7, 0,044 M 

25 benzylaminopurine, 200 ml/l Silwet IL-ll''"- [Osi Specialties, 
Inc.] containing Agrobacterium cells, placed inside a vacuum 
desiccator under a vacuum of 4 00 mm Hg (about 17 inches) for 5 
minutes. After quickly releasing the vacuum, pots were 
drained and placed on their sides in a tray covered with 

30 plastic wrap to maintain humidity for 2 4 hours. The next day 
the pots were uncovered and set upright. Plants were staked 
individually and after 2 weeks watering was gradually reduced 
to allow plants to dry out. Seeds were harvested from each 
plant individually - 

35 For selection of transformation events, 1-10 mg seeds per 

plant were surface sterilized by soaking in 10% bleach for 7 
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minutes while mixing vigorously, followed by three rinses in 
sterile water, and placed in a flask containing Arabldopsis 
germination medium (MS salts, MS vitamins, 10% sucrose, 2 . 5 mM 
2- [N~morpholino] ethanesulf onic acid [MES], 30 mg/1 kanamycin, 
5 50 mg/1 vancomycin and 0.1 % Bacto™-Agar [Difco Laboratories, 
Detroit, Ml]). After shaking in continuous light at 90 rpm 
for 3 days, seeds germinated, and t ransf ormant s were isolated 
as green seedlings between 7 and 12 days after germination. 
Nontransf ormed seeds produced small bleached seedlings. 

10 Transf ormants were transferred to solid medium (MS salts, B5 
vitamins, 10% sucrose, 2 . 5 mM MES, 15 g/1 Phytagar™ [Gibco 
BRL, Gaithersburg, MD] , 30 ml/1 kanamycin, 50 mg/1 vancomycin) 
in plates for further selection. After one to two weeks, true 
transf ormants were transferred to GA7 vessels (MS salts, B5 

15 vitamins, 0.3 % sucrose, 2 . 5 mM MES) for one to two weeks 
prior to planting in soil for production of Tl seed. 



C. Southern analyses 

Southern analysis was used to identify primary regenerate 
20 T2 Arabldopsis lines that contained intact copies of the 
specific gene construct. 

Pooled samples of Arabldopsis leaf tissue were powdered 
in liquid nitrogen- The ground tissue was then incubated for 
three minutes in 500 \il 2X extraction buffer (2% CTAB, 100 mM 
25 Tris-HCl, pH 8.0, 20 mM EDTA, 1.4 M NaCl and 2% 2- 
mercaptoethanol) at 65°C. Five hundred |li1 of 

chlorof orm/octanol (24:1) was added, the samples were shaken 
for two minutes, and then spun at 14,000 ' g in a 
microcentrifuge and the supernatant was removed. The 

30 chlorof orm/octanol extraction was repeated. One ml of 

precipitation buffer (1% CTAB, 50 mM Tris-HCl, pH 8.0, 10 mM 
EDTA, and 1% 2-mercaptoethanol ) was added to the supernatant 
and then incubated at room temperature for 60 minutes. The 
DNA was pelleted by centrif ugation at 3500g for 5 minutes in a 

35 microcentrifuge. The pellet was drained and resuspended in 
200 |Lil 1.0 M NH4OAC. One hundred (il 7 . 5 M NH4OAC and 1 ml 
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isopropanol were added^ the samples incubated on ice for 5 
minutes and then centrifuged at 14,000 ' g for 5 minutes. The 
pellet was drained and resuspended in 200 [xl TE . 100 |li1 7 . 5 M 
NH4OAC and 1 ml isopropanol were added and incubated on ice for 
5 5 minutes then centrifuged at 14,000 ' g for 5 minutes. The 
pellet was drained and rinsed with 70% ethanol and dried in a 
Speed Vac (Savant Instruments Inc., Farmingdale, NY). The 
dried pellet was resuspended in 20 |jl1 TE (10 mM TRIS, 1 mM 
EDTA, pH 8.0). 

10 Southern analysis was conducted on 29 ArAct2Af T2 lines 

and 24 AfAct2Af T2 lines. One microgram of DNA from the 
ArAct2Af plants was digested with the restriction enzyme Xbal 
using conditions suggested by the manufacturer (Bethesda 
Research Laboratory, Gaithersburg, MD) and separated by 

15 agarose gel electrophoresis, which should result in a 4 . 6 kb 
hybridization product, if the gene construct is intact, when 
radiolabeled with a probe specific for the GUS coding region . 
Similar to the ArGOS2Af rice plants, the 4.6 kb fragment 
should consist of the artificial MAR in the reverse 

20 orientation, the Act2 promoter, the GUS coding region, the nos 
3' UTR and the artificial MAR in the forward orientation. The 
DNA was blotted onto nylon membranes as described by Southern 
(1975, 1989) . Radiolabeled probe DNA was hybridized to the 
genomic DNA on the blots using 50 ml of minimal hybridization 

25 buffer (10% polyethylene glycol, 7% sodium dodecyl sulfate, 
0.6x SSC, 10 mM sodium phosphate, 5 mM EDTA and 100 mg/ml 
denatured salmon sperm DNA) was heated to 60°C and mixed with 
the denatured radiolabeled probed prior to being added to the 
blots for overnight hybridization at 60°C. The blots were 

30 washed at 60''C in 0.25X SSC and 0.2% SDS for 45 minutes, 

blotted dry and exposed to XAR-5 film with two intensifying 
screens overnight . 

The expected 4.6 kb hybridization product was detected in 
twenty-six of the twenty-nine ArAct2Af lines. A second 

35 Southern blot was generated to determine the copy number of 
the ArAct2Af construct. The ArAct2Af DNA was digested with 
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the restriction enzymes Sstl and Xhol which cut the construct 
near the right and left borders of the T DNA. The blots were 
radiolabeled with probes specific for the artificial MAR and 
the DNA from the left border to the Xhol site 800 bp 
5 downstream. A single copy of the ArAct2Af construct will have 
three hybridization products: two fragments of unknown size, 
consisting of the left and right border DNA, and the 8.9 kb 
fragment which is the DNA internal to the borders. Twenty-two 
of the twenty-nine lines had three or fewer hybridization 

10 products, indicating that a single copy of the ArAct2Af 
construct was present. 

The DNA from the AfAct2Af plants was digested with the 
restriction enzyme Xbal which, if the construct remains 
intact, should result in a 5 . 7 kb hybridization product when 

15 radiolabeled with a probe specific for the GUS coding region. 
The 5.7 kb fragment should consist of the artificial MAR in 
the forward orientation, the Act2 promoter, the GUS coding 
region, the nos 3' UTR and the artificial MAR in the forward 
orientation. It also includes the green fluorescent protein 

20 coding region and the nos 3' UTR. The expected 5,7 kb 

hybridization product was detected in all of the twenty-four 
AfAct2Af lines. 

A second Southern blot was generated to determine the 
copy number of the AfAct2Af construct. The AfAct2Af DNA was 

25 digested with the restriction enzymes Sstl and Xhol which cut 
the construct near the right and left borders of the T DNA. 
The blots were radiolabeled with probes specific for the 
artificial MAR and the DNA from the left border to the Xhol 
site 800 bp downstream. A single copy of the AfAct2Af 

30 construct will have three hybridization products: two 

fragments of unknown size, consisting of the left and right 
border DNA and the 8.9 kb fragment which is the DNA internal 
to the borders. Fifteen of the twenty-four lines had three or 
fewer hybridization products indicating that a single copy of 

35 the AfAct2Af construct was present. 
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The non-Mar control lines, Actbin, were also analyzed by 
Southern analysis. The DNA from thirty-four Actbin T2 lines 
was digested with the restriction enzyme Pstl which should, if 
the construct remains intact, result in a 3.4 kb hybridization 
5 product when radiolabeled with a probe specific for the GUS 

coding region. The 3.4 kb fragment should consist of the Act2 
promoter, the GUS coding region, and the nos 3' UTR. The 
expected 3.4 kb hybridization product was detected in thirty 
of the thirty-four Actbin lines. A second Southern blot was 

10 generated to determine the copy number of the Actbin 

construct. The DNA from the Actbin lines was digested with 
the restriction enzymes Sstl and Xhol which cut the construct 
near the right and left borders of the T DNA. The blots were 
radiolabeled with probes specific for the nos 3' UTR and the 

15 DNA from the left border to the Xhol site 800 bp downstream. 
A single copy of the Actbin construct will have three 
hybridization products: two fragments of unknown size, 
consisting of the left and right border DNA and the 8.2 kb 
fragment which is the DNA internal to the borders. Nine of 

20 the thirty-four lines had three or fewer hybridization 

products indicating that a single copy of the Actbin construct 
was present. 

D. GUS analysis 

25 For growing Arabidopsis , T2 seed was germinated in vitro 

on MS medium containing 90 mg/1 kanamycin and 3 week old 
kanamycin resistant seedlings were harvested. Two batches of 
30 seedlings per transformation event were used for GUS 
analysis and additional seedlings were used to extract DNA. 

30 For analyses of GUS activity, leaf samples were powdered 

in liquid nitrogen and samples of approximately 400 ml of 
tissue were placed in microfuge tubes. Two independent 
samples from each leaf sample was processed. The tissue was 
either stored at -70*C or extracted immediately. GUS was 

35 extracted by mixing the powdered tissue with GUS lysis buffer 
(Jefferson et al., 1987) modified by the addition of 1% 
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polyvinylpolypyrrolidone (hydrated in the buffer for at least 
one hour) and 20% glycerol. After incubation on ice for at 
least 10 min, the samples were centrifuged at 16,000 ' g for 10 
min. The supernatants were recovered and centrifuged a second 
5 time as described above. The supernatants were recovered and 
frozen on dry ice and stored at -70^C. Experiments showed that 
GUS activity was stable for at least 4 freeze-thaw cycles when 
stored in the buffer described above (W.M. Ainley, 
unpublished) . GUS activity was measured using a GUS-Light™ 

10 kit (Tropix, Inc., Bedford, MA). Five ml samples of undiluted 
extract or of extract diluted so that the luminescence was 
within the range measured by the luminometer was added to 195 
ml of the GUS-Lighf^" Reaction Buffer. Luminescence was 
integrated for 5 sec after a 5 sec delay. Protein was 

15 measured with the assay developed by Bradford (1976) using 
human serum albumin as the standard. GUS activity was 
normalized between experiments using a GUS standard obtained 
from SIGMA. The amount of plant protein in the standards and 
plant samples was the same; protein was adjusted where 

20 necessary using extracts of nontransf ormed plants. 

FIGS 6 and 7 report the results the GUS analysis. FIG 6 
reports expression observed for independent transformation 
events expressing either the base, non-MAR construct (Actbin) 
or the base construct flanked by the artificial MAR in the 

25 indicated orientation (ArAct2Af or Af-Act-Af ) . Standard 

deviations indicate the variance between plants harvested from 
different plates. Arrows indicate the relative orientations 
of the MARs- 
E. 

30 In FIG If percent of transformation events expressing GUS 

in the indicated ranges is shown. 

E. Characterization of transgenic plants expressing the 
reporter gene constructs 
35 T2 Arabidopsis plants were analyzed. The plants were 

evaluated for segregation of kanamycin resistance to determine 
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insert copy number. A few events did not fall into either one 
or two insert categories as determined by chi square analyses. 
Of the remaining plants, 72% had a single insert. Most of 
the inserts had multiple copies of the transgene. Although 
5 the Arabidopsls transformation events were grown under 
controlled environments, there were some occasional 
differences in the relative growth of plants between the 
duplicate plates. In those cases, the events were either not 
used or were grown again. The coefficient of variance of the 

10 expression determined from duplicate plates range of between 1 
and 46%, with 89 percent below 20 percent (Figure 6). 

Generally, all Arabidopsls transformation events tested 
for a construct were grown and analyzed together. Although 
the growth of the plants was under controlled environments, 

15 the possibility remained that the differences observed between 
transformation events expressing the constructs was due to 
environmental differences occurring between different 
experiments. To eliminate this possibility, three of the 
highest expressing events from the three sets of Arabidopsls 

20 transformation events were grown together and reanalyzed. 

This analyses confirmed the earlier differences between the 
sets of transformation events. 



SUMMARY OF EFFECTS OF THE ARTIFICIAL MAR ON TRANSGENE 
25 EXPRESSION 

In both rice and Arabidopsls^ the average expression 
level of transformation events expressing the MAR-containing 
constructs expressed at a higher level than those lacking the 

30 MAR elements (Table 3) . In Arabidopsls ^ orientation of the 
MARs tested influenced the level of expression. Plants 
containing constructs in which the MARs were in the same 
orientation on either side of the GUS gene construct expressed 
GUS at higher levels than plants containing constructs in 

35 which the MARs were oriented in opposite orientations. 

There appeared to be a proportionally higher expression 
level in the higher expressors than in the lower expressors in 
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each set of transformation events. To document this, 
expression of the upper quartile of the expressors was 
compared (Table 3) . Based on this data, the upper quartile of 
the transformation events expressing the AfAct2Af construct 
5 produce GUS protein at levels 5.6 times higher than in the 

upper quartile of events expressing the Actbin construct. The 
artificial MARs in opposite orientations in both rice and 
Arabidopsis enhance expression approximately two-fold over the 
respective constructs lacking MARs. 



Table 3 

Comparison of the expression of transformation events expressing either the 
base constructs or base constructs flanked by the artificial MARs. 





CONSTRUCT 




GOS2 


ArGOS2Af 


ActBin 


Ar-Act-Af 


Af-Act-Af 


species transformed 


rice 


rice 


Arabidopsis 


Arabidopsis 


Arabidopsis 


method of 
transformation 


particle 
bombardment 


particle 
bombardment 


Agrobacterium 


Agrobacterium 


Agrobacterium 


number of 

transformation events 
analyzed 


28 


25 


28 


21 


22 


average 


357 


713 


2,313 


3,673 


9,042 


median 


95 


317 


2,205 


2,772 


5,730 


upper quartile range 


258-3,076 


1,674-2,661 


3,047-5,311 


6,240-10,614 


15,595-27,179 


upper quartile 
average 


1,169 


2,099 


4,056 


7,812 


22,688 


relative average 
expression 


1.0 


2.0 


1.0 


1.6 


4.0 


relative upper average 
quartile expression 


1.0 


1.8 


1.0 


1.9 


5.6 



averages for the upper quartile of transformation event expression levels were statistically different (p 



15 is less than or equal to 0.05) based on t-test analyses. (Gopal K. Kanji 1995) 

Previous published studies (reviewed by Holmes-Davis and 
Comai, 1998) have shown that, with the exception of one MAR, 
all MARs tested to date enhance expression of reporter genes. 
20 This study represents the first report that a MAR constructed 
using elements found preferentially in MARs can enhance 
expression in plant species representing both monocotyledonous 
and dicotyledonous plants. 
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1. An isolated DNA molecule comprising bp 11 to 309 of SEQ 
ID NO: 1. 

5 

2. A DNA construct comprising, in the 5' to 3' direction: a 
transcription initiation region functional in plant cells, a 
structural gene operatively associated with the transcription 
initiation region, a 3' untranslated region, and a matrix 

10 attachment region comprised of bp 11 to 309 of SEQ ID NO: 1 
positioned either 5' to said transcription initiation region 
or 3 ' to said structural gene. 

3. A DNA construct of claim 2 wherein said matrix attachment 
15 region comprises two or more tandem copies of bp 11 to 309 of 

SEQ ID N0:1- 

4. A DNA construct of claim 2 wherein a first matrix 
attachment region comprised of bp 11 to 309 of SEQ ID NO: 1 is 

20 positioned 5' to said transcription initiation region and a 
second matrix attachment region comprised of bp 11 to 309 of 
SEQ ID NO: 1 is positioned 3' to said structural gene. 

5. A DNA construct of claim 4 wherein each of said matrix 
25 attachment regions is comprised of two or more tandem copies 

of bp 11 to 309 of SEQ ID N0:1. 

6. A method of making recombinant plant cells having 
increased expression of structural genes introduced therein 

30 which comprises: 

transforming a plant cell capable of regeneration with a DNA 
construct of claim 2. 

7 . A method of claim 6 wherein the matrix attachment region 
35 in said DNA construct comprises two or more tandem copies of 

bp 11 to 309 of SEQ ID N0:1. 

35 
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8. A method of claim 6 wherein the DNA construct includes a 
first matrix attachment region comprised of bp 11 to 309 of 
SEQ ID NO: 1 positioned 5* to said transcription initiation 
5 region and a second matrix attachment region comprised of bp 
11 to 309 of SEQ ID NO: 1 positioned 3' to said structural 
gene . 



9. A method of claim 8 wherein said first and second matrix 
10 attachment regions are each comprised of two or more tandem 

copies of bp 11 to 309 of SEQ ID N0:1, 

10. A transformed plant cell containing DNA of bp 11 to 309 
of SEQ ID NO: 1. 
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SEQUENCE LISTING 

<110> van der Geest, Apolonia H. M. 
Ainley, W, Michael 
Cowen, Neil W. 
Welter, Mary E. 
Woosley, Aaron T 

<120> ARTIFICIAL MATRIX ATTACHMENT REGION FOR INCREASING 
EXPRESSION OF GENES INTRODUCED IN PLANT CELLS 

<130> 50545 

<140> 
<141> 

<150> US 60/110, 437 

<151> 1998-12-01 

<160> 33 

<170> PatentIn Ver. 2.0 

<210> 1 
<211> 327 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: artificial MAR 
<400> 1 

ttccagatct cgatatcttt aatttctaat atatttagaa ggggtttaga tttatatatg 60 
acgtccatga aaaaaaattt taaaacgata ggccagctcc aaagaatata tttccctgga 120 
gctggtaaat attaattagt cctctccctt ctaatatatt tttcgatatt tttgattcct 180 
ttttaagatt atatagctcc atgccaagct gacttcctaa tatattttat gaagtcagca 240 
aaatttttaa aaagcacact tgacttggta tatatttata aatgtttaaa cttaatttca 300 
cttgtaaaac tcttgcagga tccgtgc 327 

<210> 2 
<211> 15 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: ARBP site 
<400> 2 

atttcasttg taaaa 15 

<210> 3 
<211> 13 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: ARBP site 
<400> 3 

tgcaggtgtc ctt 13 
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<210> 4 
<211> 13 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: ARBP site 
<400> 4 

tgggggtgta aaa 13 

<210> 5 
<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: ATF site 
<400> 5 

tgacgtccat g 11 

<210> 6 
<211> 5 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: BEAF-32 site 
<400> 6 

cgata 5 

<210> 7 
<211> 13 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: topoisomerase 
II site 



<210> 8 
<211> 16 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: topoisomerase 
II site 



<400> 7 

cnngyngktn yny 



13 



<400> 8 

asmatgcgyw yatcrt 



16 



<210> 9 
<211> 15 
<212> DNA 

<213> Artificial Sequence 



2 



wo 00/32800 



PCT/US99/28123 



<220> 

<223> Description of Artificial Sequence: topoisomerase 
II site 



<400> 9 

gtnwakattn atnnr 



15 



<210> 10 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: unwinding 
sequence 

<400> 10 

aatatattt 9 

<210> 11 

<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: SATBl site 



<210> 12 
<211> 9 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: SATBl site 
<400> 12 

ataatcttc 9 

<210> 13 

<211> 10 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: SATBl site 



<210> 14 
<211> 8 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: SATBl site 



<400> 11 
ttctaatata t 



11 



<400> 13 
ttattattta 



10 
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<400> 14 

tataaaaa S 

<210> 15 
<211> 11 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: SATBl site 

<400> 15 

aagattatat a 11 

<210> 16 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: SATBl site 

<400> 16 

ttttaatgag ataataa 17 

<210> 17 

<211> 10 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: SATBl site 
<400> 17 

tataatcttc 10 

<210> 18 
<211> 13 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: bending DNA 
site 

<400> 18 

aaannnnnnn aaa 13 

<210> 19 
<211> 7 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: exemplary 
oligo A/T tract 

<400> 19 

aaaaaaa 7 
<210> 20 
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<211> 54 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: MAR-A 
oligonucleotide 

<400> 20 

ttccagatct cgatatcttt aatttctaat atatttagaa ggggtttaga ttta 54 

<210> 21 
<211> 63 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: MAR-B 
oligonucleotide 

<400> 21 

gaaggggttt agatttatat atgacgtcca tgaaaaaaaa ttttaaaacg ataggccagc 60 
tec 63 

<210> 22 
<211> 63 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: MAR-C 
oligonucleotide 

<400> 22 

cgataggcca gctccaaaga atatatttcc ctggagctgg taaatattaa ttagtcctct 60 
ccc 63 

<210> 23 
<211> 75 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: MAR-D 
oligonucleotide 

<400> 23 

cttggcatgg agctatataa tcttaaaaag gaatcaaaaa tatcgaaaaa tatattagaa 60 
gggagaggac taatt 75 

<210> 24 
<211> 74 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: MAR-E 
oligonucleotide 

<400> 24 

ccaagtcaag tgtgcttttt aaaaattttg ctgacttcat aaaatatatt aggaagtcag 60 
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cttggcatgg agct 74 



<210> 25 
<211> 74 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: MAR-F 
oligonucleotide 



<400> 25 

gcacggatcc tgcaagagtt ttacaagtga aattaagttt aaacatttat aaatatatac 60 
caagtcaagt gtgc 74 



<210> 26 
<211> 668 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: 3' MAR in 
ArActAf 



<400> 26 

gggggatctc 

cgtccatgaa 

ctggtaaata 

tttaagatta 

aatttttaaa 

ttgtaaaact 

gatttatata 

tatttccctg 

tttttgattc 

atgaagtcag 

aacttaattt 

cggtggag 



gatatcttta 
aaaaaatttt 
ttaattagtc 
tatagctcca 
aagcacactt 
cttgcaggat 
tgacgtccat 
gagctggtaa 
ctttttaaga 
caaaattttt 
cacttgtaaa 



atttctaata 

aaaacgatag 
ct ctccctt c 
tgccaagctg 
gacttggtat 
ctcgatatct 
gaaaaaaaat 
atattaatta 
ttatatagct 
aaaaagcaca 
actcttgcag 



tatttagaag 

gccagctcca 
taatatattt 
acttcctaat 
atatttataa 
ttaatttcta 
tttaaaacga 
gtcctctccc 
ccatgccaag 
cttgacttgg 
gatccactag 



gggtttagat 

aagaatatat 
ttcgatattt 
atattttatg 
atgtttaaac 
atatatttag 
taggccagct 
ttctaatata 
ctgacttcct 
tatatattta 
ttctagagcg 



ttatatatga 

tt ccctggag 
ttgattcctt 
aagtcagcaa 
ttaatttcac 
aaggggttta 
ccaaagaata 
tttttcgata 
aatatatttt 
taaatgttta 
gccgccaccg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

668 



<210> 27 
<211> 9361 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence : pG0S2-hpt 



<400> 27 

gshttcgcgc gtttcggtga tgacggtgaa aacctctgac acatgcagct cccggagacg 60 

gtcacagctt gtctgtaagc ggatgccggg agcagacaag cccgtcaggg cgcgtcagcg 120 

ggtgttggcg ggtgtcgggg ctggcttaac tatgcggcat cagagcagat tgtactgaga 180 

gtgcaccata tgcggtgtga aataccgcac agatgcgtaa ggagaaaata ccgcatcagg 24 0 

cgccattcgc cattcaggct gcgcaactgt tgggaagggc gatcggtgcg ggcctcttcg 300 

ctattacgcc agctggcgaa agggggatgt gctgcaaggc gattaagttg ggtaacgcca 360 

gggttttccc agtcacgacg ttgtaaaacg acggccagtg aattcccgat cgatctagta 420 

acatagatga caccgcgcgc gataatttat cctagtttgc gcgctatatt ttgttttcta 480 

tcgcgtatta aatgtataat tgcgggactc taatcataaa aacccatctc ataaataacg 540 

tcatgcatta catgttaatt attacatgct taacgtaatt caacagaaat tatatgataa 600 

tcatcgcaag accggcaaca ggattcaatc ttaagaaact ttattgccaa atgtttgaac 660 

gatcggggaa attcgagctc ttactattcc tttgccctcg gacgagtgct ggggcgtcgg 720 

tttccactat cggcgagtac ttctacacag ccatcggtcc agacggccgc gcttctgcgg 780 

gcgatttgtg tacgcccgac agtcccggct ccggatcgga cgattgcgtc gcatcgaccc 840 
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tgcgcccaag 

ccaatgcgga 

tcgaagtagc 

gcgacctcgt 

cggccattgt 

tcggggcagt 

acggtgt cgt 

aaatcacgcc 

gtctggctaa 

gcgggcagtt 

caataggtca 

gccgatgcaa 

acccgcagga 

tccgagagct 

gacgtcgcgg 

caaatggccc 

ctaaatcatc 

tgtgtggagc 

gtcagatcaa 

ctgacttggt 

aaatatcagc 

ccgtgttctc 

gtgggattgt 

agacgtggtt 

gggaccactg 

tttgtaggag 

atggaatccg 

catcaatcca 

gtgggtgggg 

cctttatcgc 

aagtgacaga 

aaagtctcca 

cgatcgat ct 

tattttgttt 

tctcataaat 

aaattatatg 

ccaaatgttt 

gacgatggtg 

ccgaagttca 

tcgcgagtga 

aaatcggcga 

tgatacatat 

tgcagcccgg 

tcctgccagg 

tggacatacc 

tcggtgtgag 

ttacgcgttg 

tcgttggcaa 

ttaatcgcct 

tctttcggct 

gcagtttcat 

taagggtaat 

tgcaccatca 

aagccagtaa 

gaccggatgc 

cacagttcat 

gtcccgctag 

ctgacatcac 

acatgcgtca 

ctgcgatgga 

ttttcgtcgg 



ctgcatcatc 
gcat atacgc 
gcgtctgctg 
attgggaatc 
ccgtcaggac 
cct cggccca 
ccat cacagt 
atgtagtgta 
gatcggccgc 
cggtttcagg 
ggctctcgct 
agtgccgata 
catatccacg 
gcatcaggtc 
tgagttcagg 
cagatccgtc 
aagatgcaaa 
agcaaaggag 
gagacagagg 
ggtgctggta 
tcctccgtgg 
tccaaatgaa 
gcgtcatccc 
ggaacgtctt 
tcggcagagg 
ccaccttcct 
aggaggtttc 
cttgctttga 
gtccatcttt 
aatgatggca 
tagctgggca 
cccatgcaga 
agtaacatag 
tctatcgcgt 
aacgtcatgc 
ataatcatcg 
gaacgatcgg 
cgccaggaga 
tgccagtcca 
agatcccttt 
aattccatac 
ccagccatgc 
ctaacgtatc 
ccagaagtt c 
atccgtaata 
cgtcgcagaa 
cttccgccag 
tactccacat 
gtaagtgcgc 
tgttgcccgc 
caatcaccac 
gcgaggtacg 
gcacgttatc 
agtagaacgg 
cgacgcgaag 
agagataacc 
tgccttgtcc 
cattggccac 
ccacggtgat 
ttccggcata 
taatcaccat 



gaaatt gccg 
ccggagccgc 
ctccatacaa 
cccgaacatc 
attgttggag 
aagcat cage 
tt gccagtga 
ttgaccgatt 
agcgatcgca 
caggtcttgc 
gaattcccca 
aacat aacga 
ccctcctaca 
ggagacgctg 
ctttttcata 
cgcagctgca 
ctttt cgccc 
cttt ccccaa 
agaaacaagg 
tattagggat 
actgccttgt 
atgaacttcc 
ttacgtcagt 
ctttttccac 
catcttcaac 
tttccactat 
cggatattac 
agacgtggtt 
gggaccactg 
tttgtaggag 
atggaatccg 
tctgcaggca 
atgacaccgc 
attaaatgt a 
attacatgtt 
caagaccggc 
ggaaattcga 
gttgttgatt 
gcgtttttgc 
cttgttaccg 
ctgttcaccg 
acactgatac 
cacgccgtat 
tttttccagt 
acggttcagg 
cattacattg 
tggcgaaata 
caccacgctt 
ttgctgagtt 
ttcgaaacca 
gatgccatgt 
gtaggagttg 
gaatcctttg 
tttgtggtta 
cgggtagata 
ttcacccggt 
agttgcaacc 
cacctgccag 
atcgtccacc 
gttaaagaaa 
tcccggcggg 



t caaccaagc 
ggcgat cctg 
gccaaccacg 
gcctcgctcc 
ccgaaatccg 
tcatcgagag 
tacacatggg 
cctrgcggtc 
tccatggcct 
aacgtgacac 
atgt caagca 
t ctttgtaga 
tcgaagctga 
tcgaactttt 
t ctcacgaga 
cgggt ccagg 
tt gctaaaca 
aattactcaa 
cggacctttg 
agggttgctc 
cgagccttca 
ttatatagag 
ggagatat ca 
gatgctcct c 
gatggccttt 
cttcacaata 
cctttgttga 
ggaacgtctt 
tcggcagagg 
ccaccttcct 
aggaggtttc 
tgcaagctgc 
gcgcgataat 
taattgcggg 
aattattaca 
aacaggattc 
gctctccaat 
cattgtttgc 
agcagaaaag 
ccaacgcgca 
acgacggcgc 
tcttcactcc 
tcggtgatga 
accttctctg 
cacagcacat 
acgcaggtga 
ttcccgtgca 
gggtggtttt 
tccccgttga 
atgcctaaag 
t cat ctgccc 
gccccaatcc 
ccacgtaagt 
at caggaact 
tcacactctg 
tgccagaggt 
acctgttgat 
tcaacagacg 
caggtgttcg 
tcatggaagt 
atagtctgcc 



t ctgatagag 
caagct ccgg 
gcctccagaa 
agtcaatgac 
cgtgcacgag 
cctgcgcgac 
gatcagcaat 
cgaatgggcc 
ccgcgaccgg 
cctgtgcacg 
cttccggaat 
aaccatcggc 
aagcacgaga 
cgatcagaaa 
tctgtatccc 
aaagcaat eg 
cggtaaaatt 
cgaatcataa 
cacttgatct 
ctatccacag 
gctggatcct 
gaagggtctt 
catcaatcca 
gtgggtgggg 
cctttatcgc 
aagtgacaga 
aaagtctcca 
ctttttccac 
catcttcaac 
tttccactat 
cggatattac 
ggccgcgttt 
ttatcctagt 
actctaatca 
tgcttaacgt 
aatcttaaga 
tccccaccga 
ctccctgctg 
ccgccgactt 
atatgcctt g 
tgacgcgatc 
acatgtcggt 
taatcggctg 
ccgtttccaa 
caaagagatc 
tcggacgcgt 
cttgcggacg 
tgtcacgcgc 
ctgcctcttc 
agaggttaaa 
agtcgagcat 
agtccattaa 
ccgcatcttc 
gttcgccctt 
tctggctttt 
gcggattcac 
ccgcatcacg 
cgtggttaca 
gcgtggtgta 
aagactgctt 
agttcagttc 



ttggt caaga 
atgcctccgc 
gaagatgttg 
cgctgttatg 
gt gccggact 
ggacgcact g 
cgcgcatatg 
gaacccgctc 
ctgcagaaca 
gcgggagat g 
cgggagcgcg 
gcagctattt 
ttcttcgccc 
cttctcgaca 
gtgcctggaa 
catagtcaag 
cgaatggaca 
accaagatta 
ggggattgcc 
cttgtccacc 
ctagagtccc 
gcgaaggata 
cttgctttga 
gtccatcttt 
aatgatggca 
tagctgggca 
t cgatgatca 
gatgctcctc 
gatggccttt 
cttcacaata 
cctttgttga 
aaacaattcc 
ttgcgcgcta 
taaaaaccca 
aattcaacag 
aactttattg 
ggctgtagcc 
cggtttttca 
cggtttgcgg 
cgaggtcgca 
aaagacgcgg 
gtacattgag 
atgcagtttc 
at cgccgctt 
gctgatggta 
cgggtcgagt 
ggtatccggt 
tatcagctct 
gctgtacagt 
gccgacagca 
ctcttcagcg 
tgcgtggtcg 
atgacgacca 
cactgccact 
ggctgtgacg 
cacttgcaaa 
cagttcaacg 
gtcttgcgcg 
gagcattacg 
tttcttgccg 
gttgttcaca 



900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 
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caaacggtga tacgtacact tttcccggca ataacatacg gcgtgacatc ggcttcaaat 4560 

ggcgtatagc cgccctgatg ctcca-ccact tcctgattat tgacccacac tttgccgtaa 4620 

tgagtgaccg catcgaaacg cagcacgara cgctggcctg cccaaccttt cggtataaag 4 680 

acttcgcgct gataccagac gttgcccgca taattacgaa tatctgcatc ggcgaactga 4740 

tcgttaaaac tgcctggcac agcaa::tgcc cggctttctt gtaacgcgct ttcccaccaa 4800 

cgctgatcaa ttccacagtt ttcgcgatcc agactgaatg cccacaggcc gtcgagtttt 4860 

ttgatttcac gggttggggt ttctacagga cggaccatgg aactttgctg gtgaaagtgg 4920 

caagctacac caagaactgc acaaaggaaa tgaatcataa gaacaagcag tataactatc 4980 

ccgattacag cttcataaat ttctrtctgt aatcaagcag tcaaggaata accaaaaaga 5040 

aacttctaca ggtagataca agaggataat cgatagataa tcgatgtgaa tttgaaaaca 5100 

aaattgagga cagttcatgc cagactttca gctcagaata atgaaggggt gagagctaat 5160 

aaaaaaaata atccaaatga atacrgctta tgctacagtt catttcatat aattaaaaat 5220 

ggagatcaga aatcggataa gttcttctcc tgactaaact ataggggtat tacctattaa 5280 

ctgaactaca gctaggataa cgctataaaa gcattatttg tagcaatcaa ttcattgaac 5340 

tgaaccagaa agtgactttt taagatattt gggaaaaaaa attctgggac taaagcaaat 5400 

cggaagaaca gggaatcccc tgttcctgag gattataact gtttccatga atttcgtgat 54 60 

ggggactact gtatctaaac aagccagctg cgaaattttg gacaggctta agaatcaatc 5520 

attcaatctc atcaacggga ccgtcrtcaa agttggatta tttttgttca atagggaata 5580 

aacaaaggat agcttcgtca aatcgagaag catcactacc agaatcgagg accaaacaac 5640 

cgtactttta ttacaccaag caaaatcacc ggtgctctga ttttacctca aacaaaaggt 5700 

actcacaaaa tcgcaagatt ccgtacccta aaccatttca tttccataga gctctccaga 5760 

cgattgaaaa ccatactact aatcaaaccg aaccgataac atgcaacatc aagaacccct 5820 

ctatcccaaa tccaagaaca ggaarcatca cagatacaga tcccctttcc taacatcaac 5880 

gcccgtacta cacaacctag aacaaraaat ccaagaacaa ccgaagggca cataccctgt 5940 

gaggaggagg tggaggaggg aagagatcga ccaagaactc gaccggaaga tatggatcga 6000 

agaaggcggt cgctcggctt ctgctagtcg cgtgtccttg gtgctctccc tcttcttggc 6060 

ttggatgcct cctatataga gaggggaaaa ggggggagga atttatagat gggaggagga 6120 

ggatgcttgg ttttctttgc ctctcctcct ctctcctggc cgcaaagcct gctgttaaaa 6180 

ggttgttgcg gacttgctgt cctccgttag atcatcgttt tttgtgggtt gttctgtggg 6240 

cagccactct gttgttgcct gtgtgcccaa tatgggagat tggcgctcgc gcacgagcaa 6300 

aattcttttt ttttttgtat gtgaaaaacc taaatagttc gtttcatact tttcatgcta 6360 

ttctgtaaaa ttattttttg tattttagat attattaaaa gtggtctggt gatggtggag 6420 

tgcttgaatt cagatattgc taccraaatg tattaaatag gcgagtgata ttagagatgt 6480 

gttgctagtt gaacgtgtat tgaggaggtg cactcacaca tgcacatgag cacaaagtgt 6540 

gtgcgtaagt accttgcatc taatcgaaaa aagataaata ataaaaataa gtcaattgtc 6600 

tttaattact aaataaaaat tgggatggag taagacatgt ccttaatgat gtgcgatatg 6660 

acgaatgcac aaactataaa attarataat tatatgttta aatctttgcc aatacgttca 6720 

gaatatcttc attctatttt tttaaaaaaa atctctctct ttacccattg agttcagcta 6780 

gaaagatttt tttattcaag aagagtttga tgacaattat ggctacctcg aataatttaa 6840 

cttcatgaca gagatgtgaa cgtarattct aagcaataat gatttcattt tcccacttaa 6900 

ttactaagga aaacgaaact agtgragcga ctctttttta tttagcccga ttgccactaa 6960 

agtaggtgga tgagtttttc ttgcatagtt ctagttatca gcgctacata tataaggtct 7020 

cattttatat ttagcacacg ttccctatat tgttagttag ggggtgaaaa cggtgcagaa 7080 

actgcaggca tgcaagcttc ccgggaatgc ggccgcgcgc cgcagctggc gtaatcatgg 7140 

tcatagctgt ttcctgtgtg aaatrgttat ccgctcacaa ttccacacaa catacgagcc 7200 

ggaagcataa agtgtaaagc ctggggtgcc taatgagtga gctaactcac attaattgcg 7260 

ttgcgctcac tgcccgcttt ccagrcggga aacctgtcgt gccagctgca ttaatgaatc 7320 

ggccaacgcg cggggagagg cggtrtgcgt attgggcgct cttccgcttc ctcgctcact 7380 

gactcgctgc gctcggtcgt tcggctgcgg cgagcggtat cagctcactc aaaggcggta 7440 

atacggttat ccacagaatc aggggataac gcaggaaaga acatgtgagc aaaaggccag 7500 

caaaaggcca ggaaccgtaa aaaggccgcg ttgctggcgt ttttccatag gctccgcccc 7560 

cctgacgagc atcacaaaaa tcgacgctca agtcagaggt ggcgaaaccc gacaggacta 7 620 

taaagatacc aggcgtttcc ccctggaagc tccctcgtgc gctctcctgt tccgaccctg 7 680 

ccgcttaccg gatacctgtc cgccttrctc ccttcgggaa gcgtggcgct ttctcaatgc 7740 

tcacgctgta ggtatctcag ttcggtgtag gtcgttcgct ccaagctggg ctgtgtgcac 7800 

gaaccccccg ttcagcccga ccgcrgcgcc ttatccggta actatcgtct tgagtccaac 7860 

ccggtaagac acgacttatc gccactggca gcagccactg gtaacaggat tagcagagcg 7920 

aggtatgtag gcggtgctac agagttcttg aagtggtggc ctaactacgg ctacactaga 7 980 

aggacagtat ttggtatctg cgctctgctg aagccagtta ccttcggaaa aagagttggt 8040 

agctcttgat ccggcaaaca aaccaccgct ggtagcggtg gtttttttgt ttgcaagcag 8100 

cagattacgc gcagaaaaaa aggarctcaa gaagatcctt tgatcttttc tacggggtct 8160 
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gacgctcagt 
atcttcacct 
gagtaaactt 
tgtctatttc 
gagggctt ac 
ccagatttat 
actttatccg 
ccagttaata 
tcgtttggta 
cccatgttgt 
ttggccgcag 
ccatccgtaa 
tgtatgcggc 
agcagaactt 
atcttaccgc 
gcatctttta 
aaaaagggaa 
tattgaagca 
aaaaataaac 
gaaaccatta 
c 



ggaacgaaaa 
agat cctttt 
ggt ctgacag 
gttcatccat 
catctggccc 
cagcaat aaa 
cct ccat cca 
gtttgcgcaa 
tggcttcatt 
gcaaaaaagc 
tgttatcact 
gat gctttt c 
gaccgagttg 
taaaagtgct 
tgttgagatc 
ctttcaccag 
taagggcgac 
tttatcaggg 
aaataggggt 
ttatcatgac 



ctcacgttaa 
aaatt aaaaa 
ttaccaatgc 
agttgcctga 
cagt gctgca 
ccagccagcc 
gt ct attaat 
cgttgttgcc 
cagctccggt 
ggttagctcc 
catggttatg 
tgtgactggt 
ctcttgcccg 
catcattgga 
cagttcgatg 
cgttt ctggg 
acggaaatgt 
ttattgtctc 
tccgcgcaca 
attaacctat 



gggatt tt gg 
t gaagt t t t a 
ttaatcagtg 
ctccccgtcg 
atgat accgc 
ggaagggccg 
tgttgccggg 
attgctacag 
tcccaacgat 
ttcggtcctc 
gcagcactgc 
gagtact caa 
gcgtcaatac 
aaacgttctt 
taacccactc 
tgagcaaaaa 
tgaatactca 
atgagcggat 
tttccccgaa 
aaaaataggc 



tcatgagatt 
aatcaat eta 
aggcacctat 
tgtagataac 
gagacccacg 
agcgcagaag 
aagct agagt 
gcatcgtggt 
caaggcgagt 
cgatcgttgt 
ataattctct 
ccaagt catt 
gggataatac 
cggggcgaaa 
gtgcacccaa 
caggaaggca 
tactcttcct 
acatatttga 
aagtgccacc 
gtatcacgag 



at caaaaagg 
aagt atatat 
ctcagcgatc 
tacgatacgg 
ct caccggct 
tggtcctgca 
aagt agtt eg 
gtcacgctcg 
tacatgatcc 
cagaagtaag 
tactgtcatg 
ct gagaatag 
cgcgccacat 
actctcaagg 
ctgatcttca 
aaatgccgca 
ttttcaatat 
atgtatttag 
tgacgtct aa 
gccctttcgt 



8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9361 



<210> 28 
<211> 10629 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence : pArG0S2Af-hpt 



<400> 28 

tcgcgcgttt 
cagcttgtct 
ttggcgggtg 
accatatgcg 
attcgccatt 
tacgccagct 
tttcccagtc 
agatgacacc 
gtattaaatg 
gcattacatg 
cgcaagaccg 
ggggaaattc 
cactatcggc 
tttgtgtacg 
cccaagctgc 
tgcggagcat 
agtagcgcgt 
cctcgtattg 
cattgtccgt 
ggcagt cctc 
tgtcgtccat 
cacgccatgt 
ggctaagatc 
gcagttcggt 
aggtcaggct 
atgcaaagtg 
gcaggacata 
agagctgcat 
tcgcggtgag 
tggccccaga 



cggtgatgac 
gtaagcggat 
tcggggctgg 
gtgtgaaata 
caggctgcgc 
ggcgaaaggg 
acgacgttgt 
gcgcgcgata 
tataattgcg 
ttaattatta 
gcaacaggat 
gagctcttac 
gagtacttct 
cccgacagtc 
atcatcgaaa 
atacgcccgg 
ctgctgctcc 
ggaatccccg 
caggacattg 
ggcccaaagc 
cacagtttgc 
agtgtattga 
ggccgcagcg 
ttcaggcagg 
ctcgctgaat 
ccgataaaca 
tccacgccct 
caggtcggag 
ttcaggcttt 
tccgtccgca 



ggtgaaaacc 
gccgggagca 
cttaactatg 
ccgcacagat 
aactgttggg 
ggatgtgctg 
aaaacgacgg 
atttatccta 
ggactctaat 
catgcttaac 
tcaatcttaa 
tattcctttg 
acacagccat 
ccggctccgg 
ttgccgtcaa 
agccgcggcg 
atacaagcca 
aacatcgcct 
ttggagccga 
atcagctcat 
cagtgataca 
ccgattcctt 
at cgcatcca 
t cttgcaacg 
tccccaatgt 
taacgatctt 
cctacatcga 
acgctgtcga 
ttcatatctc 
gctgcacggg 



tctgacacat 
gacaagcccg 
cggcat caga 
gcgtaaggag 
aagggcgatc 
caaggcgatt 
ccagtgaatt 
gtttgcgcgc 
cataaaaacc 
gtaattcaac 
gaaactttat 
ccctcggacg 
cggtccagac 
atcggacgat 
ccaagctctg 
atcctgcaag 
accacggcct 
cgctccagtc 
aatccgcgtg 
cgagagcctg 
catggggatc 
gcggtccgaa 
tggcctccgc 
tgacaccctg 
caagcactt c 
tgtagaaacc 
agctgaaagc 
acttttcgat 
acgagatctg 
tccaggaaag 



gcagctcccg 
tcagggcgcg 
gcagattgta 
aaaataccgc 
ggtgcgggcc 
aagttgggta 
cccgatcgat 
tatattttgt 
catctcataa 
agaaattata 
tgccaaatgt 
agtgctgggg 
ggccgcgctt 
tgcgtcgcat 
atagagttgg 
ctccggatgc 
ccagaagaag 
aatgaccgct 
cacgaggtgc 
cgcgacggac 
agcaatcgcg 
tgggccgaac 
gaccggctgc 
tgcacggcgg 
cggaat cggg 
atcggcgcag 
acgagattct 
cagaaacttc 
tatcccgtgc 
caatcgcata 



gagacggtca 
t cagcgggtg 
ctgagagtgc 
atcaggcgcc 
tcttcgctat 
acgccagggt 
ctagtaacat 
tttctatcgc 
ataacgtcat 
tgataatcat 
ttgaacgatc 
cgtcggtttc 
ctgcgggcga 
cgaccctgcg 
tcaagaccaa 
ctccgctcga 
atgttggcga 
gttatgcggc 
cggactt egg 
gcactgacgg 
catatgaaat 
ccgctcgtct 
agaacagcgg 
gagatgcaat 
agcgcggccg 
ctatttaccc 
tcgccctccg 
tcgacagacg 
ctggaacaaa 
gtcaagctaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 
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at catcaaga 
tggagcagca 
gatcaagaga 
cttggtggtg 
atcagctcct 
gttctctcca 
gattgtgcgt 
gtggttggaa 
ccactgtcgg 
taggagccac 
aatccgagga 
aatccacttg 
gtgggggtcc 
tatcgcaatg 
gacagatagc 
tctccaccca 
tgcaagagtt 
gtgcttttta 
gctatataat 
aattaatatt 
ttttttttca 
gatatcgaga 
taccaagtca 
agcttggcat 
aagggagagg 
tatcgtttta 
attagaaatt 
tgacaccgcg 
ttaaatgtat 
ttacatgtta 
aagaccggca 
gaaattcgag 
ttgttgattc 
cgtttttgca 
ttgttaccgc 
tgttcaccga 
cactgatact 
acgccgtatt 
ttttccagta 
cggttcaggc 
attacattga 
ggcgaaatat 
accacgcttg 
tgctgagttt 
t cgaaaccaa 
atgccatgtt 
taggagttgg 
aatcctttgc 
ttgtggttaa 
gggtagatat 
tcacccggtt 
gttgcaacca 
acctgccagt 
tcgtccaccc 
ttaaagaaat 
cccggcggga 
tt cccggcaa 
tccatcactt 
agcacgatac 
ttgcccgcat 
gcaattgccc 



tgcaaacttt 
aaggagcttt 
cagaggagaa 
ctggtatatt 
ccgtggactg 
aatgaaatga 
catcccttac 
cgtcttcttt 
cagaggcatc 
cttccttttc 
ggtttccgga 
ctttgaagac 
atctttggga 
atggcatttg 
tgggcaatgg 
tgcagatctg 
ttacaagtga 
aaaattttgc 
cttaaaaagg 
taccagctcc 
tggacgtcat 
tcctgcaaga 
agtgtgcttt 
ggagctatat 
actaattaat 
aaattttttt 
aaagatatcg 
cgcgataatt 
aattgcggga 
attattacat 
acaggattca 
ctctccaatt 
attgtttgcc 
gcagaaaagc 
caacgcgcaa 
cgacggcgct 
cttcactcca 
cggtgatgat 
ccttctctgc 
acagcacatc 
cgcaggtgat 
tcccgtgcac 
ggtggttttt 
ccccgttgac 
tgcctaaaga 
catctgccca 
ccccaatcca 
cacgtaagtc 
tcaggaactg 
cacactctgt 
gccagaggtg 
cctgttgatc 
caacagacgc 
aggtgttcgg 
catggaagta 
tagtctgcca 
taacatacgg 
cctgattatt 
gctggcctgc 
aattacgaat 
ggctttcttg 



tcgcccttgc 
ccccaaaatt 
acaaggcgga 
agggataggg 
ccttgt cgag 
actt ccttat 
gtcagtggag 
ttccacgatg 
ttcaacgatg 
cactatcttc 
tattaccctt 
gtggttggaa 
ccactgtcgg 
taggagccac 
aatccgagga 
caggcatgca 
aattaagttt 
tgacttcata 
aatcaaaaat 
agggaaatat 
atataaatct 
gttttacaag 
ttaaaaattt 
aatcttaaaa 
atttaccagc 
tcatggacgt 
agatcGccca 
tatcctagtt 
ctctaatcat 
gcttaacgta 
atcttaagaa 
ccccaccgag 
tccctgctgc 
cgccgacttc 
tatgccttgc 
gacgcgat ca 
catgtcggtg 
aatcggctga 
cgtttccaaa 
aaagagatcg 
cggacgcgtc 
ttgcggacgg 
gtcacgcgct 
tgcctcttcg 
gaggttaaag 
gtcgagcatc 
gtccattaat 
cgcatcttca 
ttcgcccttc 
ctggcttttg 
cggattcacc 
cgcat cacgc 
gtggttacag 
cgtggtgtag 
agactgcttt 
gttcagttcg 
cgtgacatcg 
gacccacact 
ccaacctt tc 
atctgcatcg 
taacgcgctt 



taaacacggt 
actcaacgaa 
cctttgcact 
ttgctcctat 
ccttcagctg 
atagaggaag 
atatcacatc 
ctcctcgtgg 
gcctttcctt 
acaataaagt 
tgttgaaaag 
cgtcttcttt 
cagaggcatc 
cttccttttc 
ggtttccgga 
agctgcggcc 
aaacatttat 
aaatatatta 
atcgaaaaat 
attctttgga 
aaaccccttc 
tgaaattaag 
tgctgacttc 
aggaatcaaa 
tccagggaaa 
catatataaa 
aacaattccc 
tgcgcgctat 
aaaaacccat 
attcaacaga 
actttattgc 
gctgtagccg 
ggtttttcac 
ggtttgcggt 
gaggt cgcaa 
aagacgcggt 
tacattgagt 
tgcagtttct 
tcgccgcttt 
ctgatggtat 
gggtcgagtt 
gtatccggtt 
atcagctctt 
ctgtacagtt 
ccgacagcag 
tcttcagcgt 
gcgtggtcgt 
tgacgaccaa 
actgccactg 
gctgtgacgc 
acttgcaaag 
agtt caacgc 
tcttgcgcga 
agcatt acgc 
ttcttgccgt 
ttgttcacac 
gcttcaaatg 
ttgccgtaat 
ggtataaaga 
gcgaactgat 



tcccaccaac 
10 



aaaattcgaa 
tcataaacca 
tgatctgggg 
ccacagcttg 
gatcct ctag 
ggtcttgcga 
aatccacttg 
gtgggggtcc 
tatcgcaatg 
gacagatagc 
tctccatcga 
ttccacgatg 
ttcaacgatg 
cactatcttc 
tattaccctt 
gctctagaac 
aaatatatac 
ggaagtcagc 
atattagaag 
gctggcctat 
taaatatatt 
tttaaacatt 
ataaaatata 
aatatcgaaa 
tatattcttt 
tctaaacccc 
gatcgatcta 
attttgtttt 
ctcataaata 
aattatatga 
caaatgtttg 
acgatggtgc 
cgaagttcat 
cgcgagtgaa 
aatcggcgaa 
gatacatatc 
gcagcccggc 
cctgccaggc 
ggacatacca 
cggtgtgagc 
tacgcgttgc 
cgttggcaat 
taatcgcctg 
ctttcggctt 
cagtttcatc 
aagggtaatg 
gcaccatcag 
agccagt aaa 
accggatgcc 
acagttcata 
tcccgctagt 
tgacatcacc 
catgcgtcac 
tgcgatggat 
tttcgtcggt 
aaacggtgat 
gcgtatagcc 
gagtgaccgc 
cttcgcgctg 
cgttaaaact 
gctgatcaat 



tggacatgtg 
agattagtca 
attgccctga 
tccaccaaat 
agtcccccgt 
aggatagtgg 
ctttgaagac 
atctttggga 
atggcatttg 
tgggcaatgg 
tgatcacatc 
ctcctcgtgg 
gcctttcctt 
acaataaagt 
tgttgaaaag 
tagtggatcc 
caagtcaagt 
ttggcatgga 
ggagaggact 
cgttttaaaa 
agaaattaaa 
tataaatata 
ttaggaagtc 
aatatattag 
ggagctggcc 
ttctaaatat 
gtaacataga 
ctatcgcgta 
acgtcatgca 
taatcatcgc 
aacgatcggg 
gccaggagag 
gccagtccag 
gatccctttc 
attccatacc 
cagccatgca 
taacgtatcc 
cagaagttct 
t ccgtaataa 
gtcgcagaac 
ttccgccagt 
actccacatc 
taagtgcgct 
gttgcccgct 
aatcaccacg 
cgaggtacgg 
cacgttat eg 
gtagaacggt 
gacgcgaagc 
gagataacct 
gccttgtcca 
attggccacc 
cacggtgata 
tccggcatag 
aatcaccatt 
acgtacactt 
gccctgatgc 
atcgaaacgc 
ataccagacg 
gcctggcaca 
tccacagttt 



1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 
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tcgcgatcca 
tctacaggac 
caaaggaaat 
tctttctgta 
gaggataat c 
agactttcag 
tactgctt at 
ttcttctcct 
gctataaaag 
aagatatttg 
gtt cctgagg 
agccagctgc 
cgtcttcaaa 
atcgagaagc 
aaaatcaccg 
cgtaccctaa 
atcaaaccga 
gaatcatcac 
acaataaatc 
agagatcgac 
tgctagtcgc 
aggggaaaag 
tctcctcctc 
ctccgttaga 
tgtgcccaat 
tgaaaaacct 
attttagata 
acctaaatgt 
gaggaggtgc 
aatcgaaaaa 
gggatggagt 
ttatataatt 
tt aaaaaaaa 
agagtttgat 
gtatattcta 
gtgtagcgac 
tgcatagttc 
tccctatatt 
atatctttaa 
aaaaatttta 
taattagtcc 
atagctccat 
agcacacttg 
ttgcaggatc 
gacgtccatg 
agctggtaaa 
tttttaagat 
aaaattttta 
acttgtaaaa 
aatcatggtc 
tacgagccgg 
taattgcgtt 
aatgaatcgg 
cgctcactga 
aggcggt aat 
aaggccagca 
tccgcccccc 
caggactata 
cgaccctgcc 
ctcaatgctc 
gtgtgcacga 



gactgaatgc 
ggaccat gga 
gaatcataag 
atcaagcagt 
gat agataat 
ctcagaataa 
gctacagtt c 
gactaaacta 
cattatttgt 
ggaaaaaaaa 
attataactg 
gaaattttgg 
gttggattat 
atcactacca 
gtgctctgat 
accatttcat 
accgataaca 
agatacagat 
caagaacaac 
caagaactcg 
gtgtccttgg 
gggggaggaa 
t ct cctggcc 
tcatcgtttt 
atgggagatt 
aaatagttcg 
ttattaaaag 
attaaatagg 
actcacacat 
agataaataa 
aagacatgtc 
atatgtttaa 
tctctctctt 
gacaattatg 
agcaataatg 
tcttttttat 
tagttatcag 
gttagttagg 
tttctaatat 
aaacgatagg 
tctcccttct 
gccaagctga 
acttggtata 
tcgatatctt 
aaaaaaaatt 
tattaattag 
tatatagctc 
aaaagcacac 
ctcttgcagg 
atagctgttt 
aagcataaag 
gcgctcactg 
ccaacgcgcg 
ctcgctgcgc 
acggttatcc 
aaaggccagg 
tgacgagcat 
aagataccag 
gcttaccgga 
acgctgtagg 
accccccgtt 



ccacaggccg 
actttgctgg 
aacaagcagt 
caaggaataa 
cgatgtgaat 
tgaaggggtg 
attt catata 
taggggtatt 
agcaatcaat 
ttctgggact 
tttccatgaa 
acaggcttaa 
ttttgttcaa 
gaatcgagga 
tttacctcaa 
ttccatagag 
tgcaacat ca 
cccctttcct 
cgaagggcac 
accggaagat 
tgctctccct 
tttatagatg 
gcaaagcctg 
ttgtgggttg 
ggcgctcgcg 
tttcatactt 
tggt ctggtg 
cgagtgatat 
gcacatgagc 
taaaaataag 
cttaatgatg 
atctttgcca 
tacccattga 
gctacctcga 
atttcatttt 
ttagcccgat 
cgctacatat 
gggtgaaaac 
atttagaagg 
ccagctccaa 
aatatatttt 
cttcctaata 
tatttataaa 
taatttctaa 
ttaaaacgat 
tcctctccct 
catgccaagc 
ttgacttggt 
atccactagt 
cctgtgtgaa 
tgtaaagcct 
cccgctttcc 
gggagaggcg 
tcggtcgttc 
acagaatcag 
aaccgtaaaa 
cacaaaaatc 
gcgtttcccc 
tacctgtccg 
tatctcagtt 
cagcccgacc 



t cgagttttt 
tgaaagtggc 
ataactatcc 
ccaaaaagaa 
ttgaaaacaa 
agagct aata 
attaaaaat g 
acctattaac 
tcattgaact 
aaagcaaatc 
tttcgtgatg 
gaat caatca 
tagggaataa 
ccaaacaacc 
acaaaaggta 
ctctccagac 
agaacccct c 
aacatcaacg 
ataccctgtg 
atggatcgaa 
cttcttggct 
ggaggaggag 
ctgttaaaag 
ttctgtgggc 
cacgagcaaa 
ttcatgctat 
atggtggagt 
tagagatgt g 
acaaagtgtg 
tcaattgtct 
tgcgatatga 
atacgttcag 
gttcagctag 
ataatttaac 
cccacttaat 
tgccactaaa 
ataaggtct c 
ggtgcagaaa 
ggtttagatt 
agaatatatt 
tcgatatttt 
tattttatga 
tgtttaaact 
tatatttaga 
aggccagctc 
tctaatatat 
tgacttccta 
atatatttat 
tctagagcgg 
att gttatcc 
ggggtgccta 
agtcgggaaa 
gtttgcgtat 
ggctgcggcg 
gggataacgc 
aggccgcgtt 
gacgctcaag 
ctggaagctc 
cctttctccc 
cggtgtaggt 
gctgcgcctt 



t gattt cacg 
aagctacacc 
cgattacagc 
acttctacag 
aattgaggac 
aaaaaaataa 
gagatcagaa 
tgaactacag 
gaaccagaaa 
ggaagaacag 
gggactactg 
tt caatctca 
acaaaggata 
gtacttttat 
ctcacaaaat 
gattgaaaac 
t atcccaaat 
cccgtactac 
aggaggaggt 
gaaggcggtc 
tggatgcctc 
gatgcttggt 
gttgttgcgg 
agccact ctg 
attctttttt 
tctgtaaaat 
gcttgaattc 
tt gctagtt g 
t gcgt aagt a 
ttaattacta 
cgaatgcaca 
aatatcttca 
aaagattttt 
ttcatgacag 
tactaaggaa 
gtaggtggat 
attttatatt 
ctgcagcccg 
tatatatgac 
tccctggagc 
tgattccttt 
agtcagcaaa 
taatttcact 
aggggtttag 
caaagaatat 
ttttcgatat 
atatatttta 
aaatgtttaa 
ccgcgcgccg 
gctcacaat t 
atgagtgagc 
cctgtcgtgc 
tgggcgctct 
agcggtatca 
aggaaagaac 
gctggcgttt 
tcagaggtgg 
cctcgtgcgc 
ttcgggaagc 
cgttcgctcc 
atccggtaac 



ggttggggtt 
aagaactgca 
ttcataaatt 
gtagatacaa 
agttcatgcc 
tccaaatgaa 
atcggataag 
ctaggataac 
gtgacttttt 
ggaatcccct 
tatctaaaca 
t caacgggac 
gcttcgtcaa 
tacaccaagc 
cgcaagattc 
catactacta 
ccaagaacag 
acaacctaga 
ggaggaggga 
gctcggcttc 
ctatatagag 
tttctttgcc 
acttgctgtc 
ttgttgcctg 
tttttgtatg 
tattttttgt 
agatattgct 
aacgtgtatt 
ccttgcatct 
aataaaaatt 
aactataaaa 
ttctattttt 
ttattcaaga 
agatgtgaac 
aacgaaacta 
gagtttttct 
tagcacacgt 
ggggatctcg 
gtccatgaaa 
tggtaaatat 
ttaagattat 
atttttaaaa 
tgtaaaactc 
atttatatat 
atttccctgg 
ttttgattcc 
tgaagtcagc 
acttaatttc 
cagctggcgt 
ccacacaaca 
taactcacat 
cagctgcatt 
tccgcttcct 
gctcactcaa 
atgtgagcaa 
ttccataggc 
cgaaacccga 
tctcctgttc 
gtggcgcttt 
aagctgggct 
tatcgtcttg 



5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
68 4 0 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 
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agtccaaccc 
gcagagcgag 
acactagaag 
gagttggtag 
gcaagcagca 
cggggtctga 
caaaaaggat 
gtatatatga 
cagcgatctg 
cgatacggga 
caccggctcc 
gtcctgcaac 
gtagttcgcc 
cacgctcgtc 
catgatcccc 
gaagtaagtt 
ctgt catgcc 
gagaatagtg 
cgccacat ag 
tctcaaggat 
gatcttcagc 
atgccgcaaa 
ttcaatatta 
gtatttagaa 
acgtctaaga 
cctttcgtc 



ggt aagacac 
gt at gtaggc 
gacagtattt 
ctcttgatcc 
gattacgcgc 
cgctcagtgg 
cttcacctag 
gtaaacttgg 
tctatttcgt 
gggcttacca 
agatttatca 
tttatccgcc 
agttaatagt 
gtttggtatg 
catgttgtgc 
ggccgcagtg 
atccgtaaga 
tatgcggcga 
cagaacttt a 
cttaccgctg 
atcttttact 
aaagggaata 
ttgaagcatt 
aaataaacaa 
aaccattatt 



gacttatcgc 
ggtgctacag 
ggtatctgcg 
ggcaaacaaa 
agaaaaaaag 
aacgaaaact 
atccttttaa 
tctgacagtt 
tcatccatag 
tctggcccca 
gcaataaacc 
tccatccagt 
ttgcgcaacg 
gcttcattca 
aaaaaagcgg 
ttatcactca 
tgcttttctg 
ccgagttgct 
aaagtgctca 
ttgagatcca 
ttcaccagcg 
agggcgacac 
tatcagggtt 
ataggggttc 
atcatgacat 



cactggcagc 
agtt cttgaa 
ctctgctgaa 
ccaccgctgg 
gatctcaaga 
cacgttaagg 
att aaaaatg 
accaatgctt 
ttgcctgact 
gtgctgcaat 
agccagccgg 
ctattaattg 
ttgttgccat 
gctccggttc 
ttagctcctt 
tggttatggc 
tgactggtga 
cttgcccggc 
tcattggaaa 
gttcgatgta 
tttctgggtg 
ggaaatgttg 
attgtctcat 
cgcgcacatt 
taacctataa 



agccact ggt 
gtggtggcct 
gccagttacc 
tagcggtggt 
agatcctttg 
gattttggtc 
aagttttaaa 
aatcagtgag 
ccccgtcgtg 
gataccgcga 
aagggccgag 
ttgccgggaa 
tgctacaggc 
ccaacgat ca 
cggtcctccg 
agcactgcat 
gtactcaacc 
gt caatacgg 
acgttctt eg 
acccactcgt 
agcaaaaaca 
aatactcata 
gagcggatac 
t ccccgaaaa 
aaataggcgt 



aacaggatta 
aact acggct 
tt cggaaaaa 
ttttttgttt 
atcttttcta 
atgagattat 
t caat ctaaa 
gcacctat ct 
tagataacta 
gacccacgct 
cgcagaagtg 
gctagagt aa 
at cgtggtgt 
aggcgagtta 
atcgttgtca 
aattctctta 
aagtcatt ct 
gat aataccg 
gggcgaaaac 
gcacccaact 
ggaaggcaaa 
ctcttccttt 
atatttgaat 
gt gccacctg 
atcacgaggc 



9180 

9240 

9300 

9360 

9420 

9480 

9540 

9600 

9660 

9720 

9780 

9840 

9900 

9960 

10020 

10080 

10140 

10200 

10260 

10320 

10380 

10440 

10500 

10560 

10620 

10629 



<210> 29 
<211> 676 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: 5' MAR in 
Ar-Act-Af 



<400> 29 

ctccaccgcg 

attaagttta 

gacttcataa 

atcaaaaata 

gggaaatata 

tataaatcta 

ttttacaagt 

taaaaatttt 

atcttaaaaa 

tttaccagct 

catggacgtc 

gatcccccgg 



gtggcggccg 
aacatttata 
aatatattag 
tcgaaaaata 
ttctttggag 
aaccccttct 
gaaattaagt 
gctgacttca 
ggaatcaaaa 
ccagggaaat 
atatataaat 
gctgca 



ctctagaact 
aatatatacc 
gaagt cagct 
tattagaagg 
ctggcctatc 
aaatatatta 
ttaaacattt 
taaaatatat 
atatcgaaaa 
atattctttg 
ctaaacccct 



agtggatcct 
aagt caagtg 
t ggcatggag 
gagaggacta 
gttttaaaat 
gaaattaaag 
ataaatatat 
taggaagtca 
atatattaga 
gagctggcct 
tctaaatata 



gcaagagttt 
tgctttttaa 
ct atataatc 
attaatattt 
tttttttcat 
atatcgagat 
accaagtcaa 
gcttggcatg 
agggagagga 
atcgttttaa 
ttagaaatta 



tacaagtgaa 
aaattttgct 
ttaaaaagga 
accagctcca 
ggacgtcata 
cctgcaagag 
gtgtgctttt 
gagctatata 
ctaattaata 
aatttttttt 
aagatatcga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

676 



<210> 30 
<211> 15676 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence : pAct2-bin 



<400> 30 

gatctgcaag cgatcgcggc gcgccattta aatgcccggg cgtttaaacg cggccgcatt 60 
cccgggaagc ttgcatgcct gcagaattcg gcttgttttg acgagttcgg atgtagtagt 120 
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agccattatt 
attgtataaa 
tatgatacaa 
aacaagccaa 
taaaaaaacg 
aaagcaaaag 
cgagggaaaa 
cgtgtctgtc 
cccgcctata 
acttt cat ca 
aagagagaaa 
tcatcttctt 
ctggatctcg 
ctgtgaacct 
tagctcgatt 
tacctgggaa 
ct gaacactg 
tatagatt ct 
gattcaatca 
catggtccgt 
attcagtctg 
agaaagccgg 
tcgtaattat 
aggccagcgt 
taatcaggaa 
gtatgttatt 
gcagactat c 
cttccatgat 
gaacacctgg 
gtctgttgac 
tcaacaggtg 
cctctggcaa 
agagtgtgat 
gttcctgatt 
cttacgtggc 
gattggggcc 
ggcagatgaa 
tttaggcatt 
caacggggaa 
aaaccaccca 
gcacgggaat 
cacctgcgt c 
tgtgctgtgc 
agagaaggt a 
catcaccgaa 
gagtgaagag 
cgccgtcgt c 
gcgcgttggc 
ttttctgctg 
caaacaatga 
ttggagagct 
aatcctgttg 
gtaataatta 
ccgcaattat 
ttatcgcgcg 
cttaattaag 
gcaccggtga 
gctttctaat 
tatataccgt 
ttctgttgga 
aactgccgag 



taatgtacat 
tatccataaa 
rtctaataga 
ccacgacgac 
gagctgt cat 
aactaatcca 
ggctgcct ga 
gattttaatt 
taaattcata 
gccgttttga 
gtaagagata 
ccgctctttc 
atcttgtttt 
ccactaaatc 
atagctacca 
atgatttgta 
r caatgttag 
tcgaaacttt 
gggtttattt 
cctgtagaaa 
gatcgcgaaa 
gcaattgctg 
gcgggcaacg 
atcgtgctgc 
gtgatggagc 
gccgggaaaa 
ccgccgggaa 
ttctttaact 
gtggacgata 
tggcaggtgg 
gttgcaactg 
ccgggtgaag 
atctacccgc 
aaccacaaac 
aaaggattcg 
aactcctacc 
catggcatcg 
ggttt cgaag 
act cagcaag 
agcgtggtga 
atttcgccac 
aatgtaatgt 
ctgaaccgtt 
ctggaaaaag 
tacggcgtgg 
tatcagtgtg 
ggtgaacagg 
ggtaacaaga 
caaaaacgct 
atcaacaact 
cgaatttccc 
ccggtcttgc 
acatgtaatg 
acatttaata 
cggtgtcatc 
gccggcctgc 
gtaatattgt 
ttcaaactat 
tgtaatttga 
gtgcagccca 
gggaagccat 



actaatcgtg 
cacatcatga 
aaacgaatta 
gactaacgtt 
gtaacacgcg 
aggggctgag 
cagccaggtc 
atttttttga 
tattttcctc 
atctccggcg 
atccaggaga 
tttccaaggt 
ctcaatttcc 
ttttggtttt 
gaatttggct 
tatgtgaaat 
attgaatctg 
aggatttgta 
gactgtattg 
ccccaacccg 
actgtggaat 
tgccaggcag 
tctggtatca 
gtttcgatgc 
atcagggcgg 
gtgtacgtat 
tggtgattac 
atgccggaat 
tcaccgtggt 
tggccaatgg 
gacaaggcac 
gttatctcta 
ttcgcgtcgg 
cgttctactt 
ataacgtgct 
gtacctcgca 
tggtgattga 
cgggcaacaa 
cgcacttaca 
tgtggagtat 
tggcggaagc 
tctgcgacgc 
attacggatg 
aacttctggc 
atacgttagc 
catggctgga 
tatggaattt 
aagggatctt 
ggactggcat 
ctcctggcgc 
cgatcgttca 
gatgattatc 
catgacgtta 
cgcgatagaa 
tatgttacta 
aggcatgcaa 
acggctaaga 
tcgggcctaa 
get cgtgtga 
tttcaccgga 
cttgagcgcg 



aatagtgaat 
aagacacttt 
aattacgttg 
gcctggattg 
gat cgagcag 
atgattaatt 
acgttat ctt 
aaggccgaaa 
tccgctttga 
act tgacaga 
ttcat tctcc 
aataggaact 
ttgagatctg 
actagaatcg 
tgaccttgat 
gaaatctgaa 
aacactgttt 
gtgtcgtacg 
aactcttttt 
tgaaatcaaa 
tgat cagcgt 
ttttaacgat 
gcgcgaagtc 
ggtcactcat 
ctatacgcca 
caccgtttgt 
cgacgaaaac 
ccatcgcagc 
gacgcatgtc 
tgatgtcagc 
tagcgggact 
tgaactgtgc 
cat ccggtca 
tactggcttt 
gatggtgcac 
ttacccttac 
tgaaactgct 
gccgaaagaa 
ggcgattaaa 
tgccaacgaa 
aacgcgtaaa 
t cacaccgat 
gtatgtccaa 
ctggcaggag 
cgggctgcac 
tatgtatcac 
cgccgatttt 
cactcgcgac 
gaacttcggt 
accatcgtcg 
aacatttggc 
atataatttc 
tttatgagat 
aacaaaatat 
gatcgatcgg 
gctcggatct 
gcgaatttgg 
cttttggtgt 
ataagtcgct 
caagtcggct 
gaatgggaat 



at gatgaaac 
cttt cacggt 
aattgtatga 
actcggttta 
gt cacagt ca 
agtttaaaaa 
tacct gtggt 
ataaagttgt 
attgtctcgt 
gaagaacaag 
gttttgaat c 
ttctggatct 
gaattcgttt 
atctaagttg 
ggagagatcc 
ctgttgaagt 
aaggttagat 
ttgaacagaa 
gtgtgtttgc 
aaactcgacg 
tggtgggaaa 
cagttcgccg 
tttataccga 
tacggcaaag 
tttgaagccg 
gtgaacaacg 
ggcaagaaaa 
gtaatgctct 
gcgcaagact 
gttgaactgc 
ttgcaagtgg 
gtcacagcca 
gtggcagtga 
ggtcgtcatg 
gaccacgcat 
gctgaagaga 
gctgtcggct 
ctgtacagcg 
gagctgatag 
ccggataccc 
ctcgacccga 
accatcagcg 
agcggcgatt 
aaactgcatc 
tcaatgtaca 
cgcgtctttg 
gcgacctcgc 
cgcaaaccga 
gaaaaaccgc 
gctacagcct 
aataaagttt 
tgttgaatta 
gggtttttat 
agcgcgcaaa 
gaattgttta 
cacctacgat 
cctgtagacc 
gatgatgctg 
gtgtatgttt 
agattgattt 
ggatttcgtt 



attgtatctt 
ct gaattaat 
aatctaattg 
agttaaccac 
tgaagccatc 
ttagttaaca 
cgaaatgatt 
aagagataaa 
tgtcctcctc 
gaagaagact 
tt cctcaatc 
actttatttg 
aatttgggat 
accgatcagt 
atgttcatgt 
tagattgaat 
gaagtttgtg 
agctatttct 
agctgtagac 
gcctgtgggc 
gcgcgttaca 
at gcagatat 
aaggttgggc 
tgtgggtcaa 
atgtcacgcc 
aactgaactg 
agcagtctta 
acaccacgcc 
gtaaccacgc 
gtgatgcgga 
tgaatccgca 
aaagccagac 
agggcgaaca 
aagatgcgga 
taatggactg 
tgctcgactg 
ttaacctctc 
aagaggcagt 
cgcgtgacaa 
gtccgcaagt 
cgcgtccgat 
atctctttga 
tggaaacggc 
agccgattat 
ccgacatgtg 
atcgcgtcag 
aaggcatatt 
agtcggcggc 
agcagggagg 
cggtggggaa 
cttaagattg 
cgttaagcat 
gattagagtc 
ctaggataaa 
aacgcggccg 
ggggggcatc 
tcaattgcga 
actggcagga 
gtttgattgt 
agccctgatg 
gtacaacgag 



180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 
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acgacagaac 
attgtgccag 
ggaaacgtat 
tatcaaaaaa 
tttttctgat 
tcctaagtgt 
tgaatgtaag 
aaaagaatta 
gtcctacaat 
aaaataaaca 
acaaatcaca 
aagt gagt ga 
agt gcaaaaa 
atcataaaaa 
cttagaagca 
aattgaagtt 
aaatataaaa 
acaattgaat 
aacgacgaaa 
attcaactta 
tattctacag 
ccatttaagc 
tctgattgtt 
gctcttgcac 
tcctaaacca 
agataaatat 
tcaactgttt 
aagtaccgtt 
gaggaaataa 
cccccggaac 
gagaacattg 
cgagcagaag 
gttgccactt 
ctgcgacgcc 
cagttccgca 
cagagatgaa 
caggcggtag 
gatgaagatg 
taaacccgcc 
ctccacgaaa 
gaagaccgac 
caaccgttca 
catctctgga 
cgctccaagc 
cggcagttcg 
cagtgcccgc 
ctgaccccac 
ccaccaggcc 
tcacgcgggt 
gctcccggtg 
acttctccca 
cgatcaggac 
gcagcgacac 
gtaggcgcga 
ggt cctggca 
cgtactccaa 
tgtaggtgat 
gcgggatttt 
ctcgcatcgt 
gctgcttcgt 
cctcgccggc 



acccacggga 
gcagt tgaaa 
ttaaaaaaaa 
ttcattgttc 
t accaggaac 
ggcaacaat a 
tt t actttag 
ctagagggaa 
gccgttgtaa 
t aaaagccct 
agtgatttgt 
t tgt gggaaa 
aagatataat 
tcgaaacagc 
aacttaagag 
aaattagatg 
t attctcaaa 
ttaaaagaaa 
ctggctaaaa 
tcgtcagaaa 
tttcaattcc 
acacaaatta 
gaaaaaaaaa 
actcaagtct 
aaagtaaaca 
tggaagctat 
actaaaaat c 
acttatgagc 
gggctgagaa 
caaaggaagt 
gttccttgta 
tcct ccggcc 
gcgggt cage 
gacaggat ct 
aatagccccc 
cacgaccatc 
accgaaataa 
cgcatccacc 
ggcaacgccc 
acgccggaca 
agcccaatga 
gcgaacgcct 
gctttcttca 
cgtcgaatct 
tagagcgcgc 
ttgttcctga 
aaggccctag 
gctgcctcgc 
ggaatccgat 
cgagctgaaa 
tatgaatttc 
ctggcaacgg 
cgattccagg 
caggcattcc 
aagctcgt ag 
cacctgctgc 
cttcacgtcc 
cttgttgcgc 
gt ccggccac 
gtgtttcagc 
ggtttttcgc 



ccgagctt eg 
gtcagcacct 
tcggtaaaag 
atttactgga 
agaaagaaaa 
ttgtatt aaa 
ctgaacattt 
aagaaaccct 
aaatcggaga 
ttaaagaggg 
gattgttgat 
tttaggcgca 
gggagataag 
aaagaatggc 
tgtgttgata 
ctaaaaattt 
actttttaac 
ccgataccgt 
taagtaaaca 
aattaaaact 
ctaacaaaca 
ttaaaaaagt 
aaaaaaagcg 
cgatt cagca 
gtgtcttaat 
atacgtactt 
agtttcatca 
aagtattgtc 
agcccagtaa 
aggttaaacc 
ggcatcggga 
gccagttgcc 
acggttccga 
agcgctgcgt 
aggaccgcca 
agcggctgca 
acaacaagct 
agatt cccgt 
gcagcagcat 
gatgcgcctt 
tctcgccgtc 
ccatgggctt 
gggccgacaa 
gagccttaat 
cgtgcgcccg 
aatgccagta 
cgtttgcaat 
aactcttcgc 
ccgcacatga 
tagt cgaaca 
gtgtagtggt 
gacgttttct 
tgcccaacgc 
tcggccttcg 
aacgtgaagg 
cacaccagtt 
ttgttgacgt 
gtggtgaaca 
ggcgcaatat 
aacgcggcct 
ttcttggtcg 



caagcttgtt 
tttaacgagt 
agaaggcaat 
agctgtacct 
agaagcagaa 
aaaatcgttt 
tagaaagaaa 
ggtaaaagga 
aaagggatat 
cttttatata 
gataaaataa 
caaaaagaaa 
acggttcgtg 
ggaaacgtaa 
gtgcagtatc 
gtaattaaga 
gaatgaaaaa 
ttacgaaatt 
ggtaacgtct 
gaatactcgt 
gaggt ataaa 
ggtttttgaa 
taccttggat 
attgcttaag 
aaaacttacc 
tgtttcaaaa 
agcaatgaaa 
tatttttaat 
ggaaacaact 
cgctccgatc 
ttggcggatc 
aggccgtaaa 
acgccatgga 
ttggtgtcaa 
tcaatcgtat 
cagcgcctac 
ccagaatagc 
tggaatctgt 
accggcgacc 
gtgagcgtcc 
gatgtaggcg 
tttctcctcg 
tcggatctcg 
cacaattgtc 
agcgatactg 
aagcgctggc 
gcaccaggtc 
aggcttcgcc 
ggcggaaggt 
tccgtcgggc 
cgccagcaaa 
tgccacggtc 
ggtcggacgt 
tgtaataccg 
tgatcggct c 
cgtcatcgt c 
ggaaaatgac 
gggcagagcg 
cgaacaagga 
gcttggcctc 
tcatagttcc 



gt aact gaaa 
gctgaaat ga 
caagagaact 
aacgatatag 
aaaagtatt g 
tatggttgtt 
aaact caeca 
atcaaaacga 
attgatttta 
ttaatcacaa 
gaataagaag 
aacgaaatga 
ttcgtgctga 
aagaagttat 
ttaaaatttt 
aggagtgatt 
gtact caacc 
ggaacaggta 
attgaattag 
gtcactttaa 
attgttggga 
agccat gcgt 
attcaccgaa 
ctgccagcgg 
cgccatacca 
tgggtcaatc 
cacgccaaag 
agttatctat 
gtaggttcga 
aggccgagcc 
aaacactaaa 
ggtgagcaga 
aaccgccccc 
caccaacagc 
cgggctacct 
cgtcgccgcg 
gaaatattaa 
cggacgatca 
cctcggcctc 
ttggggccgt 
ccgaatgcca 
tgctcgtaaa 
cggaaatcct 
aattttaatc 
agcgaagcaa 
tgctgaaccc 
atcattgacc 
gacctgctcg 
ttccagcttg 
cgtcggcgac 
cagcacgacg 
caggacgcgg 
gaagcccatc 
gccattgatc 
gccgataggg 
ggcccgcagc 
cttgttttgc 
ggccgtgtcg 
aagctgcatt 
gctgacctgt 
tcgcgtgtcg 



aaggaaaatt 
cggct aaat g 
ttattacgaa 
aaaaactaaa 
taggaaaatg 
caaattatcc 
aaacaaatgt 
aagatagaaa 
tatctttctc 
atcacttatc 
aaatagaaag 
tacaccaat c 
cttgcaccat 
ggaaataaga 
gtataatagg 
acatgaacaa 
aaat aat aaa 
aagggcattt 
acagtcatct 
ttcaccaaga 
gtattcctta 
ctgacatcta 
cactagggtt 
aatgctttca 
cagatgttcc 
gagaatatcg 
taaacaattt 
tatttaacgg 
gtcgcgagat 
acgccaggcc 
gctactggaa 
ggcacgggag 
gccaggcccg 
gccacgcccg 
agcagagcgg 
acccgcccgg 
gtgcgccgag 
tcacgagcaa 
gctgtt cggg 
cctcctgttt 
cggcatctcg 
cggacccgaa 
gcacgtcggc 
ct ctgtttat 
gtgcgtcgag 
ccagccggaa 
caggcgtgtt 
cgccacttct 
agcgggtacg 
agcttgcggt 
atttcctcgt 
aagcggtgca 
gccgtcgcct 
gaccagccca 
gtgcgcttcg 
t cgacgccgg 
agcgcctcgc 
tttggcatcg 
tccttgatct 
tttgccaggt 
atggtcatcg 



3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
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acttcgccaa 
gcgcgggcag 
get ggaccat 
ttgcgatggt 
cctt ccggt c 
gggcaat gt g 
ccttatcggc 
tccgaatctt 
cggcctgaga 
catcgttgcg 
tgccatgacg 
nnctcgaaat 
agtcgctctt 
cgttttagcg 
ttgccaagct 
ccgaaccgcg 
cccaggtcgc 
attttgtagc 
gccttttcct 
cccttcctgg 
gtagccggcc 
acagtgaaga 
cgccgttgga 
gcgaaaaagg 
gcggaaaaga 
ctggcggccg 
caccgcggcc 
atgaggggcg 
ggggcaggct 
cgcctgattt 
gcggtattga 
tgaggggcag 
aggcagaaaa 
cttttaacct 
ggcgcgtgac 
taacgcgggc 
aaaaatggca 
cgttcaaata 
tcgactgctt 
ctccgttgcc 
gaaattatct 
tgcaaagcgg 
ctacatggat 
attttttttc 
ttgataaaac 
tctaattcgg 
tcgacctgca 
tcgccccatc 
tggaccagtt 
gatgcgtgat 
tgtctcaaaa 
aactgtctgc 
cgtcttgctc 
gggctcgcga 
atgcgccaga 
agatggtcag 
tccgtactcc 
aggtattaga 
tgcgccggtt 
gtctcgctca 
acgagcgtaa 



acctgccgcc 
ggcaggggga 
cgagccgacg 
tt cggcatcc 
aaacgtccga 
cccttattcc 
aatgaagtcg 
gccctgcacg 
gccaaaacac 
ccactctt ca 
tacctcggtg 
tcGctcggt c 
cttgatggag 
gctaaaaaag 
cgt cctgctt 
ccgtgcgcgg 
cattgatgcg 
cctggccgac 
caatcgctct 
ttggcttggt 
agcctcgcag 
aggaacaccc 
tacaccaagg 
atggatatac 
tccgtcgacc 
tctatggccc 
ggccgccggc 
gacgttgaca 
cgatttcggc 
tacgcgagtt 
cacttgaggg 
agtgatgaca 
tccagcattt 
gcttttaaac 
cgcgcacgcc 
ctcccatccc 
ggccaagctt 
ctttgcgatc 
ctctcgcaac 
ccgtcgctca 
gcctaaccgg 
cagcggcggc 
cagcaatgag 
aatt caaaaa 
gacaaattac 
aaatctttat 
gtgaggtctg 
atccagccag 
ggtgattttg 
ctgatccttc 
tctctgatgt 
ttacataaac 
gaggccgcga 
taatgtcggg 
gttgtttctg 
actaaactgg 
tgatgatgca 
agaatatcct 
gcattcgatt 
ggcgcaatca 
tggctggcct 



tcctgttcga 
gccagttgca 
gactggaagg 
tcggcggaaa 
ttcattcacc 
tgatttgacc 
gt cccgtaga 
aataccagcg 
ttgatgcgga 
ttaaccgcta 
tcacgggt aa 
ttgccttgct 
cgcatgggga 
tcatggctct 
ctcttcgatc 
gtcgtcggtg 
ggccagct eg 
ggccagcagg 
tcgttcgtct 
ttcatcagcc 
agcaggattc 
get cgcgggt 
aaagtctaca 
cgaaaaaatc 
ctttccgacg 
tgcaaacgcg 
gttgtggata 
cttgaggggc 
cggcgacgtg 
tcccacagat 
gcgcgactac 
gatgaggggc 
gcaagggttt 
caatatttat 
gaaggggggt 
cccaggggct 
gettggtcgt 
gtgttgcgcg 
gccat ccgae 
ccgtgttggg 
ctcagttctg 
aggatatatt 
tatgatggtc 
tgtagatgte 
gatccgtcgt 
ttcgacgtgt 
cct cgtgaag 
aaagtgaggg 
aaettttgct 
aactcagcaa 
tacattgcac 
agtaatacaa 
ttaaattcca 
caat caggtg 
aaacatggca 
et gaeggaat 
tggttactca 
gattcaggtg 
cctgtttgta 
cgaatgaata 
gttgaacaag 



gacgacgcga 
cgctgtcgcg 
tttcgcgggg 
accccgcgtc 
ct cctt gcgg 
cqcctggtgc 
ccgt ctggcc 
accccttgcc 
agaagtcggt 
tat cgaaaat 
gattaccgat 
cgteggtgat 
cgtgcttggc 
gccctcgggc 
ttcgccagca 
agccagagtt 
eggacgtgct 
taggccgaca 
ggaaggcagt 
atccgcttgc 
ccgttgagca 
gggcetactt 
cgaacccttt 
gctataatga 
ctcaccgggc 
ccagaaacgc 
ccacgcggaa 
cgactcaccc 
gagctggcca 
gatgtggaca 
tgacagatga 
gcaeetattg 
ccgcecgttt 
aaaccttgtt 
gccccccctt 
gcgcccctcg 
tcegcgtgaa 
cctgcccggt 
ggatgatgtt 
gggaaggtgc 
cgtagaaacc 
caattgtaaa 
aatatggaga 
cgcagegtta 
atttataggc 
ctacattcac 
aaggtgttgc 
agecacggtt 
ttgeeacgga 
aagttcgatt 
aagataaaaa 
ggggtgttat 
acatggatgc 
egacaatcta 
aaggtagcgt 
ttatgcctct 
ccactgcgat 
aaaatattgt 
attgtccttt 
acggtttggt 
tctggaaaga 



acgctccacg 
ctcgatcttg 
cgcacgcatg 
gatcagttct 
gat t gccccg 
ctt ggt gt cc 
gt cctt ct eg 
caaatacttg 
gcgctcctgc 
tgcttgcggc 
aaaetggaae 
gtaett cace 
aatcacgcgc 
ggaccacgcc 
gggegaggat 
tcageaggee 
eatagteeae 
ggctcatgce 
acaccttgat 
cetcatctgt 
eegecaggtg 
eacetatect 
ggcaaaatee 
eeeegaagca 
tggttgccct 
cgtcgaagcc 
aaettggcee 
ggcgcggcgt 
gcctcgcaaa 
agcctgggga 
ggggcgcgat 
acatttgagg 
ttcggccace 
tttaaecagg 
ctcgaaccct 
gccgcgaacg 
cgteggct eg 
gegt eggctg 
taaaagtecc 
acatggctca 
aacatgcaag 
tggettcatg 
aaaagaaaga 
ttataaaatg 
gaaagcaata 
gtccaaatgg 
tgaet eatac 
gatgagagct 
aeggtctgcg 
tattcaacaa 
tatatcatca 
gagccatatt 
tgatttatat 
tcgattgtat 
tgccaatgat 
t eegaccatc 
ccctgggaaa 
tgatgcgctg 
taaeagcgat 
tgatgcgagt 
aatgcataaa 



gcggcegatg 
gccgtagctt 
acggtgcgge 
tgcctgtatg 
act cacgccg 
agataatcca 
t acttggtat 
ccgtgggcct 
ttgtcgccgg 
ttgttagaat 
tgattatggc 
agctccgega 
accccccggc 
catcatgacc 
cgtggeatca 
gcccaggcgg 
gaegeeegtg 
ggccgccgcc 
aggtgggctg 
taegecggcg 
egaataaggg 
geecgget ga 
tgtatatcgt 
gggttatgea 
cgccgctggg 
gtgtgcgaga 
tcactgaeag 
tgacagatga 
tcggegaaaa 
taagtgccct 
ccttgacact 
ggctgtceac 
gctaacctgt 
get gcgccct 
cccggcccgc 
gcctcacccc 
attgtacctg 
atctcacgga 
atgtggatea 
gttctcaatg 
ctccaccggg 
tecgggaaat 
gtaattacca 
aaagtaeatt 
aacaaattat 
gggagatccg 
caggcctgaa 
ttgttgtagg 
ttgtegggaa 
agccacgttg 
tgaacaataa 
caacgggaaa 
gggtataaat 
gggaageecg 
gttaeagatg 
aagcatttta 
acagcattcc 
gcagtgttcc 
cgcgtattte 
gattttgatg 
cttttgccat 



7500 

7560 

7620 

7680 

7740 

7800 

7860 

7920 

7980 

8040 

8100 

8160 

8220 

8280 

8340 

8400 

8460 

8520 

8580 

8640 

8700 

8760 

8820 

8880 

8940 

9000 

9060 

9120 

9180 

9240 

9300 

9360 

9420 

9480 

9540 

9600 

9660 

9720 

9780 

9840 

9900 

9960 

10020 

10080 

10140 

10200 

10260 

10320 

10380 

10440 

10500 

10560 

10620 

10680 

10740 

10800 

10860 

10920 

10980 

11040 

11100 
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tctcaccgga 
aggggaaatt 
atcttgccag 
ttcaaaaata 
atgagttttt 
cttgacggga 
cacgcatctt 
ctggtccacc 
tggggcgatt 
cctgcaggtc 
tgccccggtt 
atgat ccaac 
gcttttgtcc 
gtctcagggt 
agattttcac 
tgagatatca 
gt caataat c 
aatactgcac 
agatcaagcg 
cgcgctccac 
ttgagcatgg 
ccgcgcccag 
aactccagca 
ccgaagccca 
ggcgtcgctt 
ggcgatagaa 
ggtcagccca 
gatagcggtc 
ccaccatgat 
gcatgcgcgc 
ccagatcatc 
gtttcgcttg 
cat cagccat 
ccggcacttc 
ctgcgcaagg 
cattcagggc 
gccggaacac 
gcctctccac 
aggatctcga 
gccttttgat 
aaccttattg 
ttcaacatca 
cgaatctcct 
cagaaaatga 
tgtctctttg 
gtccccagat 
cttacgcagc 
accttcccaa 
cagagaaaga 
tgcttcacaa 
aatcaaaggc 
ctggcgaaca 
acatggtgga 
accaaagggc 
attgcccagc 
aatgccatca 
ccaaagatgg 
cttcaaagca 
actatccttc 
gggggactct 
gagttgtccc 



ttcagtcgtc 
aataggttgt 
cctatggaac 
tggtattgat 
ct aat cagaa 
cggcggcttt 
cccgacaacg 
tacaacaaag 
caggcctggt 
gacggat ctg 
tcgttgcgac 
ccgaggatat 
ccaagaatga 
taatgtctat 
tgtgcgacgc 
tcaatattat 
tgatttgaat 
tgcaaatgaa 
gagtgagggc 
caatgccctc 
tctcagcgcc 
tccggtgtaa 
tgagatcccc 
acctttcata 
ggtcggtcat 
ggcgatgcgc 
tt cgccgcca 
cgccacaccc 
attcggcaag 
cttgagcctg 
ctgatcgaca 
gtggtcgaat 
gatggatact 
gcccaatagc 
aacgcccgtc 
accggacagg 
ggcggcatca 
ccaagcggcc 
tccccgggct 
atcagccaca 
aattctcttg 
aatgaatagt 
ttgtaattga 
acaggagtta 
tcattgctgt 
tagccttttc 
aggtctcatc 
gaaggttaaa 
tatatttctc 
accaaggcaa 
catggagtca 
gttcatacag 
gcacgacacg 
aattgagact 
tatctgtcac 
ttgcgataaa 
acccccaccc 
agtggattga 
gcaagaccct 
agaggatccg 
aattcttgtt 



actcatggtg 
attgatgttg 
tgcctcggtg 
aatcctgata 
ttggttaatt 
gttgaataaa 
cagaccgttc 
ctctcatcaa 
atgagtcagc 
ggggatctag 
taacatgagt 
agcaaagctc 
ggtgctatgc 
gtatcttaaa 
aaaaatatta 
aataaaaata 
atttattaat 
aattaacaca 
ctcatatccg 
gacatagatg 
ggctttaagc 
gaacgggtct 
gcgctggagg 
gaaggcggcg 
ttcgaacccc 
tgcgaatcgg 
agctctt cag 
agccggccac 
caggcatcgc 
gcgaacagtt 
agaccggctt 
gggcaggtag 
ttctcggcag 
agccagtccc 
gtggccagcc 
tcggtcttga 
gagcagccga 
ggagaacctg 
gattttctca 
agtgtgtggg 
aaaggaagtc 
ggctaagcca 
gattaacgaa 
gataaatact 
ggtaattctt 
aatttcagaa 
aagacgatct 
gatgcagtca 
aagatcagaa 
gtaatagaga 
aagattcaaa 
agtctcttac 
cttgtctact 
tttcaacaaa 
tttattgtga 
ggaaaggcca 
acgaggagca 
tgtgatatct 
tcct ctatat 
gatccgtcga 
gaattagatg 



atttct cact 
gacgagt egg 
agtgttctcc 
tgaataaatt 
ggttgt aaca 
tcgaactttt 
cgtggcaaag 
ccgtggctcc 
aacaccttct 
cagat ccgcg 
tcttggacaa 
gttcgtgcag 
atgaaggaat 
taatgttgtc 
aataaatatt 
tccattaaac 
tgtaacgaat 
tactaataaa 
gtctcagtta 
ccgggctcga 
tcaatcccat 
gtccat ccac 
atcatccagc 
gtggaatcga 
agagtcccgc 
gagcggcgat 
caatatcacg 
agtcgatgaa 
catgggtcac 
cggctggcgc 
ccatccgagt 
ccggatcaag 
gagcaaggtg 
ttcccgcttc 
acgatagccg 
caaaaagaac 
ttgtctgttg 
cgtgcaatcc 
gtctccagag 
aatcttatct 
cgcaaagtgg 
tgcttgccat 
actcttgaaa 
aaatttcttt 
ttctgcagct 
agaatgctaa 
acccgagcaa 
aaagattcag 
gtactattcc 
ttggagtctc 
tagaggacct 
gactcaatga 
ccaaaaatat 
gggtaatatc 
agatagtgga 
tcgttgaaga 
tcgtggaaaa 
ccactgacgt 
aaggaagtt c 
ccatgagcaa 
gtgatgttaa 
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tgat aacctt 
aatcgcagac 
ttcattacag 
gcagtttcat 
ctggcagagc 
gctgagttga 
caaaagttca 
Gtcactttct 
tcacgaggca 
aggggat cga 
atttgattgg 
caatggaacg 
ctacccgttg 
ggtattttgt 
attattatct 
acgatttgat 
tacataaaga 
tgcgtcaaat 
caagcacggt 
cgctgaggac 
cccaatctga 
ctctgttggg 
cggcgtcccg 
aatctcgtga 
tcagaagaac 
accgtaaagc 
ggtagccaac 
tccagaaaag 
gacgagatcc 
gagcccctga 
acgtgctcgc 
cgtatgcagc 
agatgacagg 
agtgacaacg 
cgctgcctcg 
cgggcgcccc 
tgcccagtca 
atcttgttca 
atgtgtttaa 
tcggatttca 
ttgtcggttc 
ctgatgtttc 
tgagtattat 
atagtattta 
ttaaagctca 
cccacagatg 
taatctccag 
gactaactgc 
agtatggacg 
taaaaaggta 
aacagaactc 
caagaagaaa 
caaagataca 
cggaaacctc 
aaaggaaggt 
tgcctctgcc 
agaagacgtt 
aagggatgac 
atttcatttg 
aggagaagaa 
tgggcacaaa 



atttttgacg 
cgataccagg 
aaacggcttt 
ttgatgctcg 
att acgct ga 
aggatcagat 
aaatcaccaa 
ggctggatga 
gacctcagcg 
gcccgacata 
acctgatgag 
gccaaaccgt 
atgtccaaca 
aatctcatat 
acgtvttgat 
acaaatgaca 
tcgaatagaa 
atctttgcca 
atccccgaag 
attgcctacc 
atatcctatc 
gtgggcgaag 
gaaaacgatt 
tggcaggttg 
tcgtcaagaa 
acgaggaagc 
gctatgtcct 
cggccatttt 
tcgccgtcgg 
tgctcttcgt 
tcgatgcgat 
cgccgcattg 
agat cctgcc 
tcgagcacag 
tcctgcagtt 
tgcgctgaca 
tagccgaata 
atcatgctaa 
ataggcagta 
attaggaatt 
ctttaatgtg 
ttccaagttt 
ctgtcctaat 
ttaccgccaa 
gatcttgcag 
gttagagagg 
gagatcaaat 
atcaagaaca 
attcaaggct 
gttcccactg 
gccgtaaaga 
atcttcgtca 
gtctcagaag 
ctcggatt cc 
ggctcctaca 
gacagtggtc 
ccaaccacgt 
gcacaatccc 
gagagaacac 
cttttcactg 
ttttctgtca 



11160 
11220 
11280 
11340 
11400 
11460 
11520 
11580 
11640 
11700 
11760 
11820 
11880 
11940 
12000 
12060 
12120 
12180 
12240 
12300 
12360 
12420 
12480 
12540 
12600 
12660 
12720 
12780 
12840 
12900 
12960 
13020 
13080 
13140 
13200 
13260 
13320 
13380 
13440 
13500 
13560 
13620 
13680 
13740 
13800 
13860 
13920 
13980 
14040 
14100 
14160 
14220 
14280 
14340 
14400 
14460 
14520 
14580 
14640 
14700 
14760 
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gtggagaggg 
ct ggaaaact 
gcttttcccg 
agggatacgt 
ctgaagt caa 
t caaggagga 
tatacat cac 
acattgaaga 
atggccctgt 
atcccaacga 
cacatggcat 
tttggcaata 
aatttctgtt 
tgagatgggt 
aaatatagcg 
gatcgggaat 



rgaaggtgat 
acctgttcca 
ttatccggat 
gcaggagagg 
gt tt gaggga 
cggaaacat c 
ggcagacaaa 
tggatccgtt 
ccttttacca 
aaagcgtgac 
ggatgagcta 
aagtttct ta 
gaattacgtt 
ttttatgatt 
cgcaaactag 
tagatc 



get acatacg 
t ggccaacac 
catatgaagc 
accatctctt 
gacaccctcg 
ctcggccaca 
caaaagaatg 
caactagcag 
gacaaccatt 
cacatggtcc 
tacaaataaa 
agattgaatc 
aagcatgtaa 
agagtcccgc 
gataaattat 



gaaagcttac 
ttgt cactac 
ggcacgactt 
tcaaggacga 
tcaacaggat 
agttggaata 
gaat caaagc 
accattatca 
acctgtcgac 
ttcttgagtt 
gatcctcgaa 
ctgtt gccgg 
taattaacat 
aattatacat 
cgcgcgcggt 



ccttaaattt 
tttctcttat 
cttcaagagc 
cgggaactac 
cgagcttaag 
caactacaac 
taact tcaaa 
acaaaatact 
acaatctgcc 
tgtaactgct 
tttccccgat 
tcttgcgatg 
gtaatgcatg 
ttaatacgcg 
gtcatctatg 



at t tgcact a 
ggtgttcaat 
gccatgcctg 
aagacgcgtg 
ggaatcgatt 
tcccacaacg 
at t cgccaca 
ccaattggcg 
ctttcgaaag 
gctgggatta 
cgttcaaaca 
attatcat at 
acgttattta 
at agaaaaca 
ttactagatc 



<210> 31 
<211> 17111 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence ipArActAf -bin 



14820 
14880 
14940 
15000 
15060 
15120 
15180 
15240 
15300 
15360 
15420 
15480 
15540 
15600 
15660 
15676 



<400> 31 

gatctgcaag 

agaactagtg 

tataccaagt 

tcagcttggc 

agaagggaga 

cctatcgttt 

atattagaaa 

acatttataa 

atatattagg 

cgaaaaatat 

tctttggagc 

accccttcta 

cttgttttga 

atagtgaata 

agacactttc 

attacgttga 

cctggattga 

atcgagcagg 

tgattaatta 

cgttatcttt 

aggccgaaaa 

ccgctttgaa 

cttgacagag 

t cattct cog 

ataggaactt 

tgagatctgg 

ctagaatcga 

gaccttgatg 

aaatctgaac 

acactgttta 

tgtcgtacgt 

actctttttg 

gaaatcaaaa 

gatcagcgtt 

tttaacgatc 



cgatcgcggc 
gatcctgcaa 
caagtgtgct 
atggagctat 
ggactaatta 
taaaattttt 
ttaaagatat 
atatatacca 
aagtcagctt 
attagaaggg 
tggcctatcg 
aatatattag 
cgagttcgga 
tgatgaaaca 
tttcacggtc 
attgtatgaa 
ctcggtttaa 
tcacagtcat 
gtttaaaaat 
acctgtggtc 
taaagttgta 
ttgtctcgtt 
aagaacaagg 
ttttgaatct 
tctggatcta 
aattcgttta 
tctaagttga 
gagagatcca 
tgttgaagtt 
aggttagatg 
tgaacagaaa 
tgtgtttgca 
aactcgacgg 
ggtgggaaag 
agttcgccga 



gcgccattta 
gagttttaca 
tttt aaaaat 
ataatcttaa 
atatttacca 
tttcatggac 
cgagatcctg 
agtcaagtgt 
ggcatggagc 
agaggactaa 
ttttaaaatt 
aaattaaaga 
tgtagtagta 
ttgtatctta 
tgaattaatt 
atctaattga 
gttaaccact 
gaagccatca 
tagttaacac 
gaaatgattc 
agagataaac 
gtcctcctca 
aagaagacta 
tcctcaatct 
ctttatttgc 
atttgggatc 
ccgatcagtt 
tgttcatgtt 
agattgaatc 
aagtttgtgt 
gctatttctg 
gctgtagacc 
cctgtgggca 
cgcgttacaa 
tgcagatatt 



aatgcccggg 
agtgaaatta 
ttt gctgact 
aaaggaatca 
gctccaggga 
gtcatatata 
caagagtttt 
gctttttaaa 
tatataatct 
ttaatattta 
ttttttcatg 
tatcgagatc 
gccattattt 
ttgtataaat 
atgatacaat 
acaagccaac 
aaaaaaacgg 
aagcaaaaga 
gagggaaaag 
gtgtctgtcg 
ccgcctatat 
ctttcatcag 
agagagaaag 
catcttctt c 
tggatctcga 
tgtgaacctc 
agctcgatta 
acctgggaaa 
tgaacactgt 
atagatt ctt 
attcaatcag 
atggtccgtc 
ttcagtctgg 
gaaagccggg 
cgtaattatg 



cgtttaaacg 
agtttaaaca 
t cat aaaat a 
aaaatatcga 
aatatattct 
aatctaaacc 
acaagtgaaa 
aattttgctg 
taaaaaggaa 
ccagctccag 
gacgtcatat 
ccccgggctg 
aatgtacata 
atccataaac 
tctaatagaa 
cacgacgacg 
agctgtcatg 
actaatccaa 
gctgcctgac 
attttaatta 
aaattcatat 
ccgttttgaa 
taagagataa 
cgctctttct 
tcttgttttc 
cactaaatct 
tagctaccag 
tgatttgtat 
caatgttaga 
cgaaacttta 
ggtttatttg 
ctgtagaaac 
atcgcgaaaa 
caattgct gt 
cgggcaacgt 



cggccgctct 
tttataaata 
tattaggaag 
aaaatatatt 
ttggagctgg 
ccttctaaat 
ttaagtttaa 
acttcataaa 
tcaaaaatat 
ggaaatatat 
ataaatctaa 
cagaattcgg 
ctaatcgtga 
acatcatgaa 
aacgaattaa 
actaacgttg 
taacacgcgg 
ggggctgaga 
agccaggtca 
tttttttgaa 
attttcctct 
tctccggcga 
t ccaggagat 
ttccaaggta 
tcaatttcct 
tttggtttta 
aatttggctt 
atgtgaaatg 
ttgaatctga 
ggatttgtag 
actgtattga 
cccaacccgt 
ctgtggaatt 
gccaggcagt 
ctggtatcag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 
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cgcgaagtct 

gtcactcatt 

tatacgccat 

accgtttgtg 

gacgaaaacg 

catcgcagcg 

acgcatgtcg 

gatgtcagcg 

agcgggactt 

gaactgtgcg 

atccggt cag 

actggctttg 

atggtgcacg 

tacccttacg 

gaaactgctg 

ccgaaagaac 

gcgattaaag 

gccaacgaac 

acgcgtaaac 

cacaccgata 

tatgtccaaa 

tggcaggaga 

gggctgcact 

atgtatcacc 

gccgattttg 

actcgcgacc 

aactt cggtg 

ccat cgtcgg 

acatttggca 

tataatttct 

ttatgagatg 

acaaaatata 

atcgatcggg 

gtttagattt 

gaatatattt 

cgatattttt 

attttatgaa 

gt tt aaact t 

atatttagaa 

ggccagctcc 

ctaatatatt 

gacttcctaa 

tatatttata 

ctagagcggc 

atggggggca 

cctcaattgc 

tgactggcag 

ttgtttgatt 

ttagccctga 

ttgtacaacg 

aaaaggaaaa 

gacggctaaa 

ctttattacg 

agaaaaacta 

tgtaggaaaa 

ttcaaattat 

caaaacaaat 

gaaagataga 

tatatctttc 

aaatcactta 

agaaatagaa 



t tataccgaa 
acggcaaagt 
ttgaagccga 
tgaacaacga 
gcaagaaaaa 
taatgctcta 
cgcaagactg 
ttgaactgcg 
tgcaagtggt 
tcacagccaa 
r ggcagtgaa 
gt cgt catga 
accacgcatt 
ctgaagagat 
ctgtcggctt 
tgtacagcga 
agctgatagc 
cggatacccg 
tcgacccgac 
ccatcagcga 
gcggcgattt 
aactgcat ca 
caatgtacac 
gcgtctttga 
cgacctcgca 
gcaaaccgaa 
aaaaaccgca 
ctacagcctc 
ataaagtttc 
gttgaattac 
ggtttttatg 
gcgcgcaaac 
aattgtttgg 
atatatgacg 
ccctggagct 
gattcctttt 
gt cagcaaaa 
aatttcactt 
ggggtttaga 
aaagaatata 
tttcgatatt 
tatattttat 
aatgtttaaa 
cgcttaatta 
tcgcaccggt 
gagctttcta 
gatatatacc 
gtttctgttg 
tgaactgccg 
agacgacaga 
ttattgtgcc 
tgggaaacgt 
aatatcaaaa 
aatttttctg 
tgtcctaagt 
cctgaatgta 
gtaaaagaat 
aagtcctaca 
tcaaaataaa 
tcacaaatca 
agaagtgagt 



aggttgggca 
gtgggtcaar 
tgtcacgccg 
actgaactgg 
gcagt cttac 
caccacgccg 
taaccacgcg 
tgatgcggat 
gaatccgcac 
aagccagaca 
gggcgaacag 
agatgcggac 
aatggactgg 
gctcgactgg 
taacct ct ct 
agaggcagtc 
gcgtgacaaa 
tccgcaagtg 
gcgtccgatc 
tctctttgat 
ggaaacggca 
gccgattat c 
cgacatgtgg 
tcgcgtcagc 
aggcatattg 
gtcggcggct 
gcagggaggc 
ggtggggaat 
ttaagattga 
gttaagcatg 
attagagtcc 
taggat aaat 
gggat ctcga 
tccatgaaaa 
ggtaaatatt 
taagattata 
tttttaaaaa 
gtaaaactct 
tttatatatg 
tttccctgga 
tttgattcct 
gaagtcagca 
cttaatttca 
aggccggcct 
gagtaatatt 
atttcaaact 
gttgtaattt 
gagt gcagcc 
aggggaagcc 
acacccacgg 
aggcagttga 
atttaaaaaa 
aattcattgt 
att accagga 
gtggcaacaa 
agtttacttt 
tactagaggg 
atgccgttgt 
cataaaagcc 
caagtgattt 
gattgtggga 



ggccagcgt a 
aat caggaag 
tatgttattg 
cagactatcc 
ttccatgatt 
aacacctggg 
tctgttgact 
caacaggtgg 
ctctggcaac 
gagtgtgata 
ttcctgatta 
ttacgtggca 
attggggcca 
gcagatgaac 
ttaggcattg 
aacggggaaa 
aaccacccaa 
cacgggaata 
acctgcgtca 
gtgctgtgcc 
gagaaggtac 
atcaccgaat 
agtgaagagt 
gccgtcgtcg 
cgcgttggcg 
tttctgctgc 
aaacaatgaa 
t ggagagct c 
atcctgttgc 
taataattaa 
cgcaattata 
tatcgcgcgc 
tat ctttaat 
aaaattttaa 
aattagtcct 
tagctccatg 
gcacacttga 
tgcaggat ct 
acgt ccatga 
gctggtaaat 
ttttaagatt 
aaatttttaa 
cttgtaaaac 
gcaggcatgc 
gtacggctaa 
attcgggcct 
gagctcgtgt 
catttcaccg 
atcttgagcg 
gaccgagctt 
aagtcagcac 
aatcggtaaa 
tcatttactg 
acagaaagaa 
tattgtatta 
agctgaacat 
aaaagaaacc 
aaaaatcgga 
ctttaaagag 
gtgattgttg 
aatttaggcg 



tcgtgctgcg 
tgatggagca 
ccgggaaaag 
cgccgggaat 
t ctttaacta 
tggacgatat 
ggcaggtggt 
ttgcaactgg 
cgggtgaagg 
tctacccgct 
accacaaacc 
aaggattcga 
actcctaccg 
atggcatcgt 
gtttcgaagc 
ctcagcaagc 
gcgtggtgat 
tttcgccact 
atgtaatgtt 
tgaaccgtta 
tggaaaaaga 
acggcgtgga 
atcagtgtgc 
gtgaacaggt 
gtaacaagaa 
aaaaacgctg 
tcaacaactc 
gaatttcccc 
cggtcttgcg 
catgtaatgc 
catttaatac 
ggtgtcat ct 
ttctaatata 
aacgataggc 
ctcccttcta 
ccaagctgac 
cttggtatat 
cgatatcttt 
aaaaaaattt 
attaattagt 
atatagctcc 
aaagcacact 
tcttgcagga 
aagctcggat 
gagcgaattt 
aacttttggt 
gaataagtcg 
gacaagtcgg 
cggaatggga 
cgcaagcttg 
cttttaacga 
agagaaggca 
gaagctgtac 
aaagaagcag 
aaaaaat cgt 
tttagaaaga 
ctggtaaaag 
gaaaagggat 
ggcttttata 
atgataaaat 
cacaaaaaga 



tt t cgat gcg 
t cagggcggc 
tgtacgtatc 
ggtgattacc 
tgccggaatc 
caccgtggtg 
ggccaatggt 
acaaggcact 
ttatctctat 
tcgcgtcggc 
gttctacttt 
taacgtgctg 
tacctcgcat 
ggtgattgat 
gggcaacaag 
gcacttacag 
gt ggagtatt 
ggcggaagca 
ctgcgacgct 
ttacggatgg 
acttctggcc 
tacgttagcc 
atggctggat 
atggaatttc 
agggatcttc 
gactggcatg 
t cctggcgca 
gat cgttcaa 
atgattatca 
atgacgttat 
gcgatagaaa 
atgttactag 
tttagaaggg 
cagctccaaa 
atatattttt 
ttcctaatat 
atttataaat 
aatttctaat 
taaaacgata 
cct ctccctt 
atgccaagct 
tgacttggta 
tccactagtt 
ctcacctacg 
ggcctgtaga 
gtgatgatgc 
ctgtgtatgt 
ctagattgat 
atggatttcg 
ttgtaactga 
gtgctgaaat 
atcaagagaa 
ctaacgatat 
aaaaaagtat 
tttatggttg 
aaaaactcac 
gaatcaaaac 
atattgattt 
tattaatcac 
aagaataaga 
aaaacgaaat 



2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
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gatacaccaa 

gacttgcacc 

atggaaataa 

ttgtataata 

ttacatgaac 

ccaaataat a 

taaagggcat 

agacagtcat 

aattcaccaa 

gagtatt cct 

gtctgacat c 

aacactaggg 

ggaatgcttt 

cacagatgtt 

t cgagaatat 

agtaaacaat 

attatt taac 

ttactaaagg 

agaggtccct 

gctcgggggc 

cggaaccaaa 

acattggttc 

agaagtcctc 

cacttgcggg 

acgccgacag 

ccgcaaatag 

atgaacacga 

ggtagaccga 

agatgcgcat 

ccgccggcaa 

cgaaaacgcc 

ccgacagccc 

gttcagcgaa 

ctggagcttt 

caagccgtcg 

gttcgtagag 

cccgcttgtt 

cccacaaggc 

aggccgctgc 

cgggtggaat 

cggtgcgagc 

tcccatatga 

aggacctggc 

gacaccgatt 

cgcgacaggc 

tggcaaagct 

tccaacacct 

gtgatcttca 

attttcttgt 

atcgtgtccg 

ttcgtgtgtt 

ccggcggttt 

gccaaacctg 

ggcagggcag 

accatcgagc 

atggtttcgg 

cggtcaaacg 

atgtgccctt 

tcggcaatga 

atcttgccct 

tgagagccaa 



t cagtgcaaa 
atatcataaa 
gacttagaag 
ggaattgaag 
aaaaatat aa 
aaacaattga 
tt aacgacga 
ctattcaact 
gatattctac 
taccatttaa 
tat ctgattg 
ttgctctt gc 
catcctaaac 
ccagataaat 
cgt caactgt 
ttaagtaccg 
gggaggaaat 
gaatgtagat 
agcgcctacg 
tgagaaagcc 
ggaagtaggt 
ctgtaggcat 
cggccgccag 
tcagcacggt 
gatctagcgc 
cccccaggac 
ccatcagcgg 
aataaacaac 
ccaccagatt 
cgcccgcagc 
ggacagatgc 
aatgatctcg 
cgcct ccatg 
cttcagggcc 
aatctgagcc 
cgcgccgtgc 
cctgaaatgc 
cctagcgtt t 
ct cgcaact c 
ccgatccgca 
tgaaatagtc 
atttcgtgta 
aacgggacgt 
ccaggtgccc 
attcctcggc 
cgtagaacgt 
gctgccacac 
cgtccttgtt 
tgcgcgtggt 
gccacggcgc 
tcagcaacgc 
ttcgcttctt 
ccgcctcctg 
ggggagccag 
cgacggactg 
catcctcggc 
tccgattcat 
attcctgatt 
agtcggtccc 
gcacgaatac 
aacacttgat 



aaaagatata 
aatcgaaaca 
caaacttaag 
ttaaattaga 
aatattctca 
atttaaaaga 
aact ggct aa 
tatcgtcaga 
agtttcaatt 
gcacacaaat 
ttgaagaagg 
acact caagt 
caaaagtaaa 
attggaagct 
ttactaaaaa 
ttacttatga 
aatt ctatga 
aaattattag 
gggaatttgt 
cagtaaggaa 
taaacccgct 
cgggattggc 
ttgccaggcc 
tccgaacgcc 
tgcgtttggt 
cgccatcaat 
ctgcacagcg 
aagctccaga 
cccgttggaa 
agcataccgg 
gccttgtgag 
ccgtcgatgt 
ggctttttct 
gacaatcgga 
ttaatcacaa 
gcccgagcga 
cagtaaagcg 
gcaatgcacc 
ttcgcaggct 
catgaggcgg 
gaacatccgt 
gtggtcgcca 
tttcttgcca 
aacgcggtcg 
cttcgtgtaa 
gaaggtgatc 
cagttcgtca 
gacgtggaaa 
gaacagggca 
aatatcgaac 
ggcctgcttg 
ggtcgtcata 
ttcgagacga 
ttgcacgctg 
gaaggttt eg 
ggaaaacccc 
tcaccctcct 
tgacccgcct 
gtagaccgtc 
cagcgacccc 
gcggaagaag 



atgggagata 
gcaaagaat g 
agtgtgttga 
tgctaaaaat 
aaacttt tt a 
aaccgat acc 
aataagtaaa 
aaaattaaaa 
ccctaacaaa 
tattaaaaaa 
att ct acaag 
ctcgattcag 
cagtgtctta 
atatacgtac 
t cagtttcat 
gcaagtattg 
gtcgcttttg 
gtatactact 
atcgataagg 
acaactgtag 
ccgatcaggc 
ggatcaaaca 
gtaaaggtga 
atggaaaccg 
gtcaacacca 
cgtatcgggc 
cctaccgtcg 
atagcgaaat 
tctgtcggac 
cgacccctcg 
cgtccttggg 
aggcgccgaa 
cct cgtgctc 
tct cgcggaa 
ttgt^caattt 
tactgagcga 
ctggctgctg 
aggtcatcat 
t cgccgacct 
aaggtttcca 
cgggccgtcg 
gcaaacagca 
cggtccagga 
gacgtgaagc 
taccggccat 
ggctcgccga 
tcgtcggccc 
atgaccttgt 
gagcgggccg 
aaggaaagct 
gcctcgctga 
gttcctcgcg 
cgcgaacgct 
tcgcgctcga 
cggggcgcac 
gcgtcgat ca 
tgcgggattg 
ggtgccttgg 
tggccgtcct 
ttgcccaaat 
tcggtgcgct 



agacggttcg 
gcggaaacgt 
tagtgcagta 
ttgtaattaa 
acgagt gaaa 
gtt t acgaaa 
caggtaacgt 
ctgaatactc 
cagaggtata 
gtggtttttg 
cgtaccttgg 
caattgctta 
ataaaactta 
tttgtttcaa 
caagcaatga 
tctattttta 
taaatttgga 
gacagcttcc 
ggtacaaatt 
gttcgagtcg 
cgagccacgc 
Gtaaagctac 
gcagaggcac 
cccccgccag 
acagcgccac 
tacctagcag 
ccgcgacccg 
attaagtgcg 
gatcatcacg 
gcctcgctgt 
gccgtcctcc 
tgccacggca 
gtaaacggac 
atcctgcacg 
taatcctctg 
agcaagtgcg 
aacccccagc 
tgacccaggc 
gctcgcgcca 
gcttgagcgg 
gcgacagctt 
cgacgatttc 
cgcggaagcg 
ccat cgccgt 
tgatcgacca 
taggggtgcg 
gcagctcgac 
tttgcagcgc 
tgtcgtttgg 
gcattt cctt 
cctgttttgc 
tgtcgatggt 
ccacggcggc 
tcttggccgt 
gcatgacggt 
gttcttgcct 
ccccgactca 
tgtccagata 
t ctcgtactt 
acttgccgtg 
cctgcttgtc 



tgttcgtgct 
aaaagaagt t 
tcttaaaatt 
gaaggagtga 
aagtactcaa 
ttggaacagg 
ctattgaatt 
gtgtcacttt 
aaattgttgg 
aaagccatgc 
atattcaccg 
agctgccagc 
cccgccatac 
aatgggtcaa 
aacacgccaa 
atagttatct 
aagttacacg 
aaggagctaa 
cccactaagc 
cgagatcccc 
caggccgaga 
t ggaacgagc 
gggaggttgc 
gcccgctgcg 
gcccgcagtt 
agcggcagag 
cccggcaggc 
ccgaggatga 
agcaataaac 
tcgggctcca 
tgtttgaaga 
t ctcgcaacc 
ccgaacatct 
tcggccgctc 
tttatcggca 
tcgagcagtg 
cggaactgac 
gtgttccacc 
cttcttcacg 
gtacggctcc 
gcggtacttc 
ctcgtcgat c 
gtgcagcagc 
cgcctgtagg 
gcccaggtcc 
cttcgcgtac 
gccggtgtag 
ctcgcgcggg 
catcgctcgc 
gat ctgctgc 
caggtcct eg 
catcgacttc 
cgatggcgcg 
agcttgctgg 
gcggcttgcg 
gtatgcctt c 
cgccggggca 
atccacctta 
ggtattccga 
ggcctcggcc 
gccggcatcg 



5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
84 60 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
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ttgcgccact 

tgacgtacct 

gaaattccct 

ctcttcttga 

t agcggctaa 

aagctcgtcc 

ccgcgccgtg 

gtcgccattg 

gtagccctgg 

ttcctcaatc 

cctggttggc 

cggccagcct 

gaagaaggaa 

ttggatacac 

aaaggatgga 

aaagat ccgt 

ggccgt ct at 

cggccggccg 

gggcggacgt 

aggctcgatt 

gattttacgc 

attgacactt 

ggcagagtga 

gaaaatccag 

aacctgcttt 

gtgaccgcgc 

cgggcct ccc 

tggcaggcca 

aaatactttg 

tgcttctctc 

ttgccccgtc 

tatctgccta 

agcggcagcg 

tggatcagca 

ttttcaattc 

aaaacgacaa 

ttcggaaatc 

ctgcagtgag 

ccatcatcca 

cagttggtga 

gtgatctgat 

caaaatct ct 

tctgcttaca 

tgctcgaggc 

cgcgataatg 

ccagagttgt 

gtcagactaa 

actcctgatg 

ttagaagaat 

cggttgcatt 

gctcaggcgc 

cgtaatggct 

ccggattcag 

aaattaatag 

gccagcctat 

aaatatggta 

tttttctaat 

cgggacggcg 

atcttcccga 

ccacctacaa 

cgattcaggc 



cttcattaac 
cggtgtcacg 
cggtcttgcc 
tggagcgcat 
aaaagtcatg 
tgcttctctt 
cgcgggtcgt 
atgcgggcca 
ccgacggcca 
gctcttcgtt 
ttggtttcat 
cgcagagcag 
cacccgctcg 
caaggaaagt 
tataccgaaa 
cgaccctttc 
ggccctgcaa 
ccggcgttgt 
tgacacttga 
tcggccggcg 
gagtttccca 
gaggggcgcg 
tgacagat ga 
catttgcaag 
taaaccaata 
acgccgaagg 
atccccccag 
agcttgcttg 
cgatcgtgtt 
gcaacgccat 
gctcaccgtg 
accggctcag 
gcggcaggat 
atgagtatga 
aaaaatgtag 
attacgatcc 
tttatttcga 
gtctgcctcg 
gccagaaagt 
ttttgaactt 
ccttcaactc 
gatgttacat 
taaacagtaa 
cgcgattaaa 
tcgggcaatc 
ttctgaaaca 
actggctgac 
atgcatggtt 
atcctgatt c 
cgattcctgt 
aatcacgaat 
ggcctgttga 
tcgtcactca 
gttgtattga 
ggaactgcct 
ttgataatcc 
cagaattggt 
gctttgttga 
caacgcagac 
caaagctctc 
ctggtatgag 



cgct aratcg 
ggtaagatta 
ttgc wcgtcg 
ggggacgtgc 
get czgccct 
cgatcttcgc 
cggtgagcca 
gctcgcggac 
gcaggtaggc 
cgt ctggaag 
cagccat ccg 
gattcccgtt 
cggg-cgggcc 
etacacgaac 
aaatcgctat 
cgacgctcac 
acgcgccaga 
ggataccacg 
ggggccgact 
acgtggagct 
cagar gatgt 
act actgaca 
ggggcgcacc 
ggttrccgcc 
tttaraaacc 
ggggtgcccc 
gggc" gcgcc 
gtcgTitccgc 
gcgcgcctgc 
ccgacggatg 
ttggggggaa 
ttctgcgtag 
atatrcaatt 
tggtcaatat 
atgtccgcag 
gtcgtattta 
cgtgt ctaca 
tgaagaaggt 
gagggagcca 
ttgcTittgcc 
agcaaaagtt 
tgcacaagat 
tacaaggggt 
ttccaacatg 
aggtgcgaca 
tggcaaaggt 
ggaatttatg 
actcaccact 
aggr gaaaat 
ttgraattgt 
gaaraacggt 
acaagtctgg 
tggrgatttc 
tgt uggacga 
cggrgagtgt 
tgaratgaat 
taat: tggttg 
ataaatcgaa 
cgtr ccgtgg 
atcaaccgtg 
tcaacaacac 



aaaattgctt 
ccgataaact 
gtgatgtact 
ttggcaatca 
cgggcggacc 
cagcagggcg 
gagttt cage 
gtgctcatag 
cgacaggctc 
gcagt acacc 
cttgccctca 
gagcaccgcc 
tacttcacct 
cctttggcaa 
aatgaccccg 
cgggctggtt 
aacgccgtcg 
cggaaaactt 
cacccggcgc 
ggccagcctc 
ggacaagcct 
gatgaggggc 
tattgacatt 
cgtttttcgg 
ttgtttttaa 
cccttctcga 
cctcggccgc 
gtgaacgt eg 
ccggtgcgtc 
atgtttaaaa 
ggtgcacatg 
aaaccaacat 
gtaaatggct 
ggagaaaaag 
cgttattata 
taggcgaaag 
ttcaegtcca 
gttgctgact 
cggttgatga 
acggaacggt 
cgatttattc 
aaaaatatat 
gttatgagcc 
gatgetgatt 
atctatcgat 
agcgttgcca 
cctcttccga 
gcgatccctg 
attgttgatg 
ccttttaaca 
ttggttgatg 
aaagaaatgc 
tcacttgata 
gtcggaatcg 
tctccttcat 
aaattgcagt 
taacactggc 
cttttgctga 
caaageaaaa 
gctccctcac 
cttcttcacg 



geggettgtt 
ggaact gatt 
tcaccagctc 
cgcgcacccc 
acgcccat ca 
aggat egtgg 
aggccgecca 
tccacgacgc 
atgccggccg 
ttgataggtg 
tctgtt acgc 
aggtgcgaat 
atcctgcccg 
aatcctgtat 
aagcagggtt 
gecetcgccg 
aagcegtgtg 
ggccctcact 
ggcgttgaca 
geaaategge 
ggggat aagt 
gcgat ecttg 
tgaggggctg 
ccaccgctaa 
ccagggctgc 
accctcccgg 
gaacggcet c 
getcgattgt 
ggctgatctc 
gtcccatgtg 
gctcagttct 
geaagctcca 
tcatgtecgg 
aaagagtaat 
aaatgaaagt 
caataaacaa 
aatgggggag 
cataccaggc 
gagctttgtt 
ctgcgttgtc 
aacaaagcca 
catcatgaac 
atattcaacg 
tatatgggta 
tgtatgggaa 
atgatgttac 
ceat caagca 
ggaaaacagc 
cgctggcagt 
gcgatcgcgt 
cgagtgattt 
ataaactttt 
accttatttt 
cagaccgata 
tacagaaacg 
ttcatttgat 
agagcattac 
gttgaaggat 
gtteaaaatc 
tttctggctg 
aggcagacct 



agaattgcca 
ar ggcnnct c 
cgcgaagtcg 
ccggccgttt 
t gacettgcc 
cat cacegaa 
ggcggcccag 
ccgtgatttt 
ccgccgcctt 
ggctgccctt 
cggcggtagc 
aagggacagt 
gctgacgccg 
atcgtgcgaa 
atgcagcgga 
ctgggctggc 
cgagacaccg 
gacagatgag 
gatgaggggc 
gaaaacgcct 
gccctgcggt 
acacttgagg 
tccacaggca 
cctgtctttt 
gccctggcgc 
cccgctaacg 
accccaaaaa 
acctgcgttc 
acggatcgac 
gatcactccg 
caatggaaat 
ccgggtgcaa 
gaaat ctaca 
taccaatttt 
acattttgat 
attattctaa 
at ccgtcgac 
ctgaatcgcc 
gt aggtggac 
gggaagatgc 
cgttgtgtct 
aataaaactg 
ggaaacgtct 
taaatgggct 
gcccgatgcg 
agatgagatg 
ttttatccgt 
attccaggta 
gttcctgcgc 
atttcgtcte 
tgatgacgag 
gccattctca 
tgacgagggg 
ecaggatctt 
gctttttcaa 
get cgatgag 
gctgacttga 
cagatcacgc 
accaactggt 
gatgatgggg 
cagcgcctgc 



9480 

9540 

9600 

9660 

9720 

9780 

9840 

9900 

9960 

10020 

10080 

10140 

10200 

10260 

10320 

10380 

10440 

10500 

10560 

10620 

10680 

10740 

10800 

10860 

10920 

10980 

11040 

11100 

11160 

11220 

11280 

11340 

11400 

11460 

11520 

11580 

11640 

11700 

11760 

11820 

11880 

11940 

12000 

12060 

12120 

12180 

12240 

12300 

12360 

12420 

12480 

12540 

12600 

12660 

12720 

12780 

12840 

12900 

12960 

13020 

13080 
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aggtcgacgg atctggggga tctagcagat 
cggtttcgtt gcgactaaca tgagttcttg 
ccaacccgag gatatagcaa agctcgttcg 
tgtccccaag aatgaggtgc tatgcatgaa 
agggttaatg tctatgtatc ttaaataatg 
ttcactgtgc gacgcaaaaa tattaaataa 
tatcatcaat attataataa aaatatccat 
taatctgatt tgaatattta ttaattgtaa 
tgcactgcaa atgaaaatta acacatacta 
aagcggagtg agggcctcat atccggtctc 
tccaccaatg ccctcgacat agatgccggg 
catggtctca gcgccggctt taagctcaat 
cccagtccgg tgtaagaacg ggtctgtcca 
cagcatgaga tccccgcgct ggaggatcat 
gcccaacctt tcatagaagg cggcggtgga 
cgcttggtcg gtcatttcga accccagagt 
tagaaggcga tgcgctgcga atcgggagcg 
gcccattcgc cgccaagctc ttcagcaata 
cggtccgcca cacccagccg gccacagtcg 
atgatattcg gcaagcaggc atcgccatgg 
cgcgccttga gcctggcgaa cagttcggct 
tcatcctgat cgacaagacc ggcttccatc 
gcttggtggt cgaatgggca ggtagccgga 
gccatgatgg atactttctc ggcaggagca 
acttcgccca atagcagcca gtcccttccc 
caaggaacgc ccgtcgtggc cagccacgat 
agggcaccgg acaggtcggt cttgacaaaa 
aacacggcgg catcagagca gccgattgtc 
tccacccaag cggccggaga acctgcgtgc 
ctcgatcccc gggctgattt tctcagtctc 
ttgatatcag ccacaagtgt gtgggaatct 
tattgaattc tcttgaaagg aagtccgcaa 
catcaaatga atagtggcta agccatgctt 
ctcctttgta attgagatta acgaaactct 
aatgaacagg agttagataa atactaaatt 
ctttgtcatt gctgtggtaa ttcttttctg 
cagattagcc ttttcaattt cagaaagaat 
gcagcaggtc tcatcaagac gatctacccg 
cccaagaagg ttaaagatgc agtcaaaaga 
aaagatatat ttctcaagat cagaagtact 
cacaaaccaa ggcaagtaat agagattgga 
aaggccatgg agtcaaagat tcaaatagag 
gaacagttca tacagagtct cttacgactc 
gtggagcacg acacgcttgt ctactccaaa 
agggcaattg agacttttca acaaagggta 
ccagctatct gtcactttat tgtgaagata 
catcattgcg ataaaggaaa ggccatcgtt 
gatggacccc cacccacgag gagcatcgtg 
aagcaagtgg attgatgtga tatctccact 
ccttcgcaag acccttcctc tatataagga 
actctagagg atccggatcc gtcgaccatg 
gtcccaattc ttgttgaatt agatggtgat 
gagggtgaag gtgatgctac atacggaaag 
aaactacctg ttccatggcc aacacttgtc 
tcccgttatc cggatcatat gaagcggcac 
tacgtgcagg agaggaccat ctctttcaag 
gtcaagtttg agggagacac cctcgtcaac 
gaggacggaa acatcctcgg ccacaagttg 
atcacggcag acaaacaaaa gaatggaatc 
gaagatggat ccgttcaact agcagaccat 
cctgtccttt taccagacaa ccattacctg 



ccgcgagggg atcgagcccg acatatgccc 13140 
gacaaatttg attggacctg atgagatgat 13200 
tgcagcaatg gaacggccaa accgtgcttt 13260 
ggaatctacc cgttgatgtc caacagtctc 13320 
ttgtcggtat tttgtaatct catatagatt 13380 
atattattat tatctacgtv ttgattgaga 13440 
taaacacgat ttgatacaaa tgacagtcaa 13500 
cgaattacat aaagatcgaa tagaaaatac 13560 
ataaatgcgt caaatatctt tgccaagatc 13620 
agttacaagc acggtatccc cgaagcgcgc 13680 
ctcgacgctg aggacattgc ctaccttgag 13740 
cccatcccaa tctgaatatc ctatcccgcg 13800 
tccacctctg ttggggtggg cgaagaactc 13860 
ccagccggcg tcccggaaaa cgattccgaa 13920 
atcgaaatct cgtgatggca ggttgggcgt 13980 
cccgctcaga agaactcgtc aagaaggcga 14040 
gcgataccgt aaagcacgag gaagcggtca 14100 
tcacgggtag ccaacgctat gtcctgatag 14160 
atgaatccag aaaagcggcc attttccacc 14220 
gtcacgacga gatcctcgcc gtcgggcatg 14280 
ggcgcgagcc cctgatgctc ttcgtccaga 14340 
cgagtacgtg ctcgctcgat gcgatgtttc 14400 
tcaagcgtat gcagccgccg cattgcatca 144 60 
aggtgagatg acaggagatc ctgccccggc 14 520 
gcttcagtga caacgtcgag cacagctgcg 14580 
agccgcgctg cctcgtcctg cagttcattc 14640 
agaaccgggc gcccctgcgc tgacagccgg 14700 
tgttgtgccc agtcatagcc gaatagcctc 147 60 
aatccatctt gttcaatcat gctaaaggat 14820 
cagagatgtg tttaaatagg cagtagcctt 14880 
tatcttcgga tttcaattag gaattaacct 14940 
agtggttgtc ggttccttta atgtgttcaa 15000 
gccatctgat gtttcttcca agtttcgaat 15060 
tgaaatgagt attatctgtc ctaatcagaa 15120 
tctttatagt atttattacc gccaatgtct 15180 
cagctttaaa gctcagatct tgcaggtccc 15240 
gctaacccac agatggttag agaggcttac 15300 
agcaataatc tccaggagat caaatacctt 15360 
ttcaggacta actgcatcaa gaacacagag 15420 
attccagtat ggacgattca aggcttgctt 15480 
gtctctaaaa aggtagttcc cactgaatca 15540 
gacctaacag aactcgccgt aaagactggc 15600 
aatgacaaga agaaaatctt cgtcaacatg 15660 
aatatcaaag atacagtctc agaagaccaa 15720 
atatccggaa acctcctcgg attccattgc 15780 
gtggaaaagg aaggtggctc ctacaaatgc 15840 
gaagatgcct ctgccgacag tggtcccaaa 15900 
gaaaaagaag acgttccaac cacgtcttca 15960 
gacgtaaggg atgacgcaca atcccactat 16020 
agttcatttc atttggagag aacacggggg 16080 
agcaaaggag aagaactttt cactggagtt 16140 
gttaatgggc acaaattttc tgtcagtgga 16200 
cttaccctta aatttatttg cactactgga 16260 
actactttct cttatggtgt tcaatgcttt 16320 
gacttcttca agagcgccat gcctgaggga 16380 
gacgacggga actacaagac gcgtgctgaa 16440 
aggatcgagc ttaagggaat cgatttcaag 16500 
gaatacaact acaactccca caacgtatac 16560 
aaagctaact tcaaaattcg ccacaacatt 16620 
tatcaacaaa atactccaat tggcgatggc 16680 
tcgacacaat ctgccctttc gaaagatccc 1674 0 
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aacgaaaagc 
ggcatggat g 
caataaagtt 
ctgttgaatt 
tgggttttta 
tagcgcgcaa 
ggaattagat 



gtgaccacat 
agct atacaa 
tcttaagatt 
acgttaagca 
tgattagagt 
actaggataa 
c 



ggtccttctr gagtttgtaa 
ataaagatcc tcgaatttcc 
gaatcctgrr gccggtcttg 
tgtaataatr aacatgtaat 
cccgcaatta tacatttaat 
attatcgcgc gcggtgtcat 



ctgctgctgg gattacacat 16800 
ccgatcgttc aaacatttgg 16860 
cgatgattat catataattt 16920 
gcatgacgtt atttatgaga 16980 
acgcgataga aaacaaaata 17040 
ctatgttact agatcgatcg 17100 

17111 



<210> 32 
<211> 17116 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : pAfActAf -bin 



<400> 32 

gatctgcaag 

cccgggggat 

tgacgtccat 

gagctggtaa 

ctttttaaga 

caaaattttt 

cacttgtaaa 

ttagatttat 

atatatttcc 

atatttttga 

tttatgaagt 

ttaaacttaa 

ttcggcttgt 

cgtgaatagt 

atgaaagaca 

attaaattac 

cgttgcctgg 

cgcggatcga 

tgagatgatt 

ggtcacgtta 

ttgaaaggcc 

cctctccgct 

ggcgacttga 

gagattcatt 

aggtaatagg 

ttccttgaga 

ttttactaga 

ggcttgacct 

aaatgaaatc 

tctgaacact 

tgtagtgtcg 

attgaactct 

cccgtgaaat 

gaattgatca 

gcagttttaa 

atcagcgcga 

atgcggtcac 

gcggctatac 

gt at caccgt 

ttaccgacga 

gaatccatcg 

tggtgacgca 

atggtgatgt 

gcactagcgg 



cgatcgcggc 
ctcgatatct 
gaaaaaaaat 
atattaatta 
ttatatagct 
aaaaagcaca 
actcttgcag 
atatgacgtc 
ctggagctgg 
ttccttttta 
cagcaaaatt 
tttcacttgt 
tttgacgagt 
gaatatgatg 
ctttctttca 
gttgaattgt 
attgactcgg 
gcaggtcaca 
aattagttta 
tctttacctg 
gaaaataaag 
ttgaattgtc 
cagagaagaa 
ctccgttttg 
aactttctgg 
tctggaattc 
atcgatctaa 
tgatggagag 
tgaactgttg 
gtttaaggtt 
tacgttgaac 
ttttgtgtgt 
caaaaaactc 
gcgttggtgg 
cgatcagttc 
agtctttata 
tcattacggc 
gccatttgaa 
ttgtgtgaac 
aaacggcaag 
cagcgtaatg 
tgt cgcgcaa 
cagcgttgaa 
gactttgcaa 



gcgccattta 
ttaatttcta 
tttaaaacga 
gtcctctccc 
ccatgccaag 
cttgactt gg 
gatctcgata 
catgaaaaaa 
taaatattaa 
agattatata 
tttaaaaagc 
aaaact cttg 
tcggatgtag 
aaacattgta 
cggtctgaat 
atgaaatcta 
tttaagttaa 
gtcatgaagc 
aaaattagtt 
tggtcgaaat 
ttgtaagaga 
tcgttgtcct 
caaggaagaa 
aatcttcctc 
atctacttta 
gtttaatttg 
gttgaccgat 
atccatgtt c 
aagttagatt 
agatgaagtt 
agaaagctat 
ttgcagctgt 
gacggcctgt 
gaaagcgcgt 
gccgatgcag 
ccgaaaggtt 
aaagtgtggg 
gccgatgtca 
aacgaactga 
aaaaagcagt 
ctctacacca 
gactgtaacc 
ctgcgtgatg 
gtggtgaatc 



aatgcccggg 
atatatttag 
taggccagct 
ttctaatata 
ctgacttcct 
tatatattta 
tctttaattt 
aattttaaaa 
ttagtcctct 
gctccatgcc 
acacttgact 
caggatccac 
tagtagccat 
tcttattgta 
taattatgat 
attgaacaag 
ccactaaaaa 
catcaaagca 
aacacgaggg 
gattcgtgtc 
taaacccgcc 
cctcacttt c 
gactaagaga 
aatctcatct 
tttgctggat 
ggatctgtga 
cagttagctc 
atgttacctg 
gaatctgaac 
tgtgtataga 
ttctgattca 
agaccatggt 
gggcattcag 
tacaagaaag 
atattcgtaa 
gggcaggcca 
tcaataat ca 
cgccgtatgt 
actggcagac 
cttacttcca 
cgccgaacac 
acgcgtctgt 
cggatcaaca 
cgcacctctg 



cgtttaaacg 
aaggggttta 
ccaaagaata 
tttttcgata 
aatatatttt 
taaatgttta 
ctaat at att 
cgataggcca 
cccttctaat 
aagctgactt 
tggtatatat 
tagttctagg 
tatttaatgt 
taaatatcca 
acaattctaa 
ccaaccacga 
aacggagctg 
aaagaactaa 
aaaaggctgc 
tgtcgatttt 
tatataaatt 
atcagccgtt 
gaaagtaaga 
tcttccgctc 
ctcgatcttg 
acctccacta 
gattatagct 
ggaaatgatt 
actgtcaatg 
ttcttcgaaa 
atcagggttt 
ccgt cctgta 
t ctggatcgc 
ccgggcaatt 
ttatgcgggc 
gcgtatcgtg 
ggaagt gatg 
tattgccggg 
tatcccgccg 
tgatttcttt 
ctgggtggac 
tgactggcag 
ggtggttgca 
gcaaccgggt 



cggccgcatt 
gatttatata 
tatttccctg 
tttttgattc 
atgaagt cag 
aacttaattt 
tagaaggggt 
gctccaaaga 
atatttttcg 
cctaatatat 
ttataaatgt 
ggctgcagaa 
acatactaat 
taaacacatc 
tagaaaacga 
cgacgactaa 
t catgtaaca 
tccaaggggc 
ctgacagcca 
aattattttt 
catatatttt 
ttgaat ct CO 
gataatccag 
ttt ctttcca 
ttttctcaat 
aatcttttgg 
accagaattt 
tgtatatgtg 
ttagattgaa 
ctttaggatt 
atttgactgt 
gaaaccccaa 
gaaaactgtg 
gctgtgccag 
aacgtctggt 
ctgcgtttcg 
gagcatcagg 
aaaagtgtac 
ggaatggtga 
aactatgccg 
gatatcaccg 
gtggtggcca 
actggacaag 
gaaggttatc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 
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tctatgaact gtgcgtcaca gccaaaagcc 
tcggcatccg gtcagtggca gtgaagggcg 
actttactgg ctttggtcgt catgaagatg 
tgctgatggt gcacgaccac gcattaatgg 
cgcattaccc ttacgctgaa gagatgctcg 
ttgatgaaac tgctgctgtc ggctttaacc 
acaagccgaa agaactgtac agcgaagagg 
tacaggcgat taaagagctg atagcgcgtg 
gtattgccaa cgaaccggat acccgtccgc 
aagcaacgcg taaactcgac ccgacgcgtc 
acgctcacac cgataccatc agcgatctct 
gatggtatgt ccaaagcggc gatttggaaa 
tggcctggca ggagaaactg catcagccga 
tagccgggct gcactcaatg tacaccgaca 
tggatatgta tcaccgcgtc tttgatcgcg 
atttcgccga ttttgcgacc tcgcaaggca 
tcttcactcg cgaccgcaaa ccgaagtcgg 
gcatgaactt cggtgaaaaa ccgcagcagg 
gcgcaccatc gtcggctaca gcctcggtgg 
ttcaaacatt tggcaataaa gtttcttaag 
tatcatataa tttctgttga attacgttaa 
gttatttatg agatgggttt ttatgattag 
agaaaacaaa atatagcgcg caaactagga 
actagatcga tcgggaattg tttgggggat 
aaggggttta gatttatata tgacgtccat 
ccaaagaata tatttccctg gagctggtaa 
tttttcgata tttttgattc ctttttaaga 
aatatatttt atgaagtcag caaaattttt 
taaatgttta aacttaattt cacttgtaaa 
ctaatatatt tagaaggggt ttagatttat 
cgataggcca gctccaaaga atatatttcc 
cccttctaat atatttttcg atatttttga 
aagctgactt cctaatatat tttatgaagt 
tggtatatat ttataaatgt ttaaacttaa 
tagttctaga gcggccgctt aattaaggcc 
ctacgatggg gggcatcgca ccggtgagta 
gtagacctca attgcgagct ttctaatttc 
gatgctgact ggcaggatat ataccgttgt 
tatgtttgtt tgattgtttc tgttggagtg 
ttgatttagc cctgatgaac tgccgagggg 
tttcgttgta caacgagacg acagaacacc 
actgaaaaag gaaaattatt gtgccaggca 
gaaatgacgg ctaaatggga aacgtattta 
gagaacttta ttacgaatat caaaaaattc 
gatatagaaa aactaaattt ttctgattac 
agtattgtag gaaaatgtcc taagtgtggc 
ggttgttcaa attatcctga atgtaagttt 
ctcaccaaaa caaatgtaaa agaattacta 
aaaacgaaag atagaaagtc ctacaatgcc 
gattttatat ctttctcaaa ataaacataa 
atcacaaatc acttatcaca aatcacaagt 
taagaagaaa tagaaagaag tgagtgattg 
gaaatgatac accaatcagt gcaaaaaaag 
gtgctgactt gcaccatatc ataaaaatcg 
aagttatgga aataagactt agaagcaaac 
aaattttgta taataggaat tgaagttaaa 
agtgattaca tgaacaaaaa tataaaatat 
ctcaaccaaa taataaaaca attgaattta 
acaggtaaag ggcatttaac gacgaaactg 
gaattagaca gtcatctatt caacttatcg 
actttaattc accaagatat tctacagttt 



agacagagtg tgatatctac ccgcttcgcg 2700 
aacagttcct gattaaccac aaaccgttct 2760 
cggacttacg tggcaaagga ttcgataacg 2820 
actggattgg ggccaactcc taccgtacct 2880 
actgggcaga tgaacatggc atcgtggtga 2940 
tctctttagg cattggtttc gaagcgggca 3000 
cagtcaacgg ggaaactcag caagcgcact 3060 
acaaaaacca cccaagcgtg gtgatgtgga 3120 
aagtgcacgg gaatatttcg ccactggcgg 3180 
cgatcacctg cgtcaatgta atgttctgcg 3240 
ttgatgtgct gtgcctgaac cgttattacg 3300 
cggcagagaa ggtactggaa aaagaacttc 3360 
ttatcatcac cgaatacggc gtggatacgt 3420 
tgtggagtga agagtatcag tgtgcatggc 34 80 
tcagcgccgt cgtcggtgaa caggtatgga 3540 
tattgcgcgt tggcggtaac aagaaaggga 3600 
cggcttttct gctgcaaaaa cgctggactg 3660 
gaggcaaaca atgaatcaac aactctcctg 3720 
ggaattggag agctcgaatt tccccgatcg 3780 
attgaatcct gttgccggtc ttgcgatgat 3840 
gcatgtaata attaacatgt aatgcatgac 3900 
agtcccgcaa ttatacattt aatacgcgat 3960 
taaattatcg cgcgcggtgt catctatgtt 4020 
ctcgatatct ttaatttcta atatatttag 4080 
gaaaaaaaat tttaaaacga taggccagct 4140 
atattaatta gtcctctccc ttctaatata 4200 
ttatatagct ccatgccaag ctgacttcct 4260 
aaaaagcaca cttgacttgg tatatattta 4320 
actcttgcag gatctcgata tctttaattt 4380 
atatgacgtc catgaaaaaa aattttaaaa 4440 
ctggagctgg taaatattaa ttagtcctct 4500 
ttccttttta agattatata gctccatgcc 4560 
cagcaaaatt tttaaaaagc acacttgact 4620 
tttcacttgt aaaactcttg caggatccac 4680 
ggcctgcagg catgcaagct cggatctcac 4740 
atattgtacg gctaagagcg aatttggcct 4800 
aaactattcg ggcctaactt ttggtgtgat 4860 
aatttgagct cgtgtgaata agtcgctgtg 4920 
cagcccattt caccggacaa gtcggctaga 4980 
aagccatctt gagcgcggaa tgggaatgga 504 0 
cacgggaccg agcttcgcaa gcttgttgta 5100 
gttgaaagtc agcacctttt aacgagtgct 5160 
aaaaaaatcg gtaaaagaga aggcaatcaa 5220 
attgttcatt tactggaagc tgtacctaac 5280 
caggaacaga aagaaaaaga agcagaaaaa 534 0 
aacaatattg tattaaaaaa atcgttttat 5400 
actttagctg aacattttag aaagaaaaaa 54 60 
gagggaaaag aaaccctggt aaaaggaatc 5520 
gttgtaaaaa tcggagaaaa gggatatatt 5580 
aagcccttta aagagggctt ttatatatta 5640 
gatttgtgat tgttgatgat aaaataagaa 5700 
tgggaaattt aggcgcacaa aaagaaaaac 57 60 
atataatggg agataagacg gttcgtgttc 5820 
aaacagcaaa gaatggcgga aacgtaaaag 5880 
ttaagagtgt gttgatagtg cagtatctta 5940 
ttagatgcta aaaatttgta attaagaagg 6000 
tctcaaaact ttttaacgag tgaaaaagta 6060 
aaagaaaccg ataccgttta cgaaattgga 6120 
gctaaaataa gtaaacaggt aacgtctatt 6180 
tcagaaaaat taaaactgaa tactcgtgtc 6240 
caattcccta acaaacagag gtataaaatt 6300 
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gttgggagta 
catgcgtctg 
caccgaacac 
ccagcggaat 
cataccacag 
gtcaatcgag 
gccaaagtaa 
tatctattat 
acacgttact 
gctaaagagg 
taagcgctcg 
tcccccggaa 
cgagaacatt 
cgagcagaag 
gttgccactt 
ct gcgacgcc 
cagttccgca 
cagagatgaa 
caggcggtag 
gatgaagatg 
taaacccgcc 
ctccacgaaa 
gaagaccgac 
caaccgttca 
catctctgga 
cgctccaagc 
cggcagtt eg 
cagtgcccgc 
ctgaccccac 
ccaccaggcc 
tcacgcgggt 
gctcccggtg 
actt ctccca 
cgatcaggac 
gcagcgacac 
gtaggcgcga 
ggtcctggca 
cgtact ccaa 
t gtaggtgat 
gcgggatttt 
ctcgcatcgt 
gctgcttcgt 
cctcgccggc 
acttcgccaa 
gcgcgggcag 
gctggaccat 
ttgcgatggt 
ccttccggtc 
gggcaatgtg 
ccttatcggc 
tccgaatctt 
cggcctgaga 
catcgttgcg 
tgccatgacg 
nnctcgaaat 
agtcgctctt 
cgttttagcg 
ttgccaagct 
ccgaaccgcg 
cccaggtcgc 
attttgtagc 



ttcctt acca 
acatct at ct 
tagggttgct 
gctttcatcc 
atgttccaga 
aatatcgtca 
acaatttaag 
ttaacgggag 
aaagggaatg 
t ccct agcgc 
ggggct gaga 
ccaaaggaag 
ggttcctgta 
tcctccggcc 
gcgggtcagc 
gacaggatct 
aatagccccc 
cacgaccatc 
accgaaataa 
cgcatccacc 
ggcaacgccc 
acgccggaca 
agcccaatga 
gcgaacgcct 
gctttcttca 
cgtcgaatct 
tagagcgcgc 
ttgttcctga 
aaggccctag 
gctgcctcgc 
ggaatccgat 
cgagctgaaa 
tatgaatttc 
ctggcaacgg 
cgattccagg 
caggcattcc 
aagctcgtag 
cacctgctgc 
ctt cacgt cc 
cttgttgcgc 
gtccggccac 
gtgtttcagc 
ggtttttcgc 
acctgccgcc 
ggcaggggga 
cgagccgacg 
ttcggcatcc 
aaacgtccga 
cccttattcc 
aatgaagtcg 
gccctgcacg 
gccaaaacac 
ccactcttca 
tacctcggtg 
tccctcggtc 
cttgatggag 
gctaaaaaag 
cgtcctgctt 
ccgtgcgcgg 
cattgatgcg 
cctggccgac 



tttaagcaca 
gatt gttgaa 
cttgcacact 
taaaccaaaa 
taaatattgg 
actgtttact 
taccgt t act 
gaaataattc 
tagataaatt 
ctacggggaa 
aagcccagta 
taggttaaac 
ggcat cggga 
gccagttgcc 
acggttccga 
agcgct gcgt 
aggaccgcca 
^gcggctgca 
acaacaagct 
agattcccgt 
gcagcagcat 
gatgcgcctt 
tctcgccgtc 
ccatgggctt 
gggccgacaa 
gagccttaat 
cgtgcgcccg 
aatgccagta 
cgtttgcaat 
aactcttcgc 
ccgcacatga 
tagtcgaaca 
gtgtagtggt 
gacgttttct 
tgcccaacgc 
tcggccttcg 
aacgtgaagg 
cacaccagtt 
ttgttgacgt 
gtggtgaaca 
ggcgcaatat 
aacgcggcct 
ttcttggtcg 
tcctgttcga 
gccagttgca 
gactggaagg 
tcggcggaaa 
ttcattcacc 
tgatttgacc 
gtcccgtaga 
aataccagcg 
ttgatgcgga 
ttaaccgcta 
tcacgggtaa 
ttgccttgct 
cgcatgggga 
tcatggctct 
ctcttcgatc 
gtcgtcggtg 
ggccagctcg 
ggccagcagg 



caaattatta 
gaaggat tct 
caagtctcga 
gtaaacagtg 
aagct at at a 
aaaaat cagt 
tatgagcaag 
tatgagtcgc 
attaggtata 
tttgtatcga 
aggaaacaac 
ccgct ccgat 
ttggcggatc 
aggccgtaaa 
acgccatgga 
ttggtgtcaa 
t caatcgtat 
cagcgcctac 
ccagaatagc 
tggaatctgt 
accggcgacc 
gtgagcgtcc 
gatgtaggcg 
tttctcctcg 
tcggatctcg 
cacaattgtc 
agcgatact g 
aagcgctggc 
gcaccaggtc 
aggcttcgcc 
ggcggaaggt 
tccgtcgggc 
cgccagcaaa 
tgccacggt c 
ggtcggacgt 
tgtaataccg 
tgatcggctc 
cgtcatcgtc 
ggaaaatgac 
gggcagagcg 
cgaacaagga 
gcttggcctc 
tcatagttcc 
gacgacgcga 
cgctgtcgcg 
tttcgcgggg 
accccgcgtc 
ctccttgcgg 
cgcctggtgc 
ccgtctggcc 
accccttgcc 
agaagtcggt 
tatcgaaaat 
gattaccgat 
cgtcggtgat 
cgtgcttggc 
gccctcgggc 
ttcgccagca 
agccagagtt 
cggacgtgct 
taggccgaca 



aaaaagtggt 
acaagcgt ac 
ttcagcaatt 
tcttaataaa 
cgtactttgt 
ttcat caagc 
tatt gtctat 
ttttgtaaat 
ctactgacag 
taaggggtac 
tgtaggttcg 
caggccgagc 
aaacactaaa 
ggtgagcaga 
aaccgccccc 
caccaacagc 
cgggctacct 
cgtcgccgcg 
gaaatattaa 
cggacgatca 
cctcggcctc 
ttggggccgt 
ccgaatgcca 
tgctcgtaaa 
cggaaatcct 
aattttaatc 
agcgaagcaa 
tgctgaaccc 
atcattgacc 
gacctgctcg 
ttccagcttg 
cgtcggcgac 
cagcacgacg 
caggacgcgg 
gaagcccatc 
gccattgatc 
gccgataggg 
ggcccgcagc 
cttgttttgc 
ggccgtgtcg 
aagctgcatt 
gctgacctgt 
tcgcgtgtcg 
acgctccacg 
ctcgatcttg 
cgcacgcatg 
gatcagttct 
gattgccccg 
cttggtgtcc 
gt ccttctcg 
caaatacttg 
gcgctcctgc 
tgcttgcggc 
aaactggaac 
gtacttcacc 
aatcacgcgc 
ggaccacgcc 
gggcgaggat 
tcagcaggcc 
catagtccac 
ggctcatgcc 



ttt tgaaagc 
ctt ggatat t 
gcttaagctg 
acttacccgc 
tt caaaatgg 
aatgaaacac 
ttttaatagt 
ttggaaagtt 
cttccaagga 
aaattcccac 
agt cgcgaga 
cacgccaggc 
gctactggaa 
ggcacgggag 
gccaggcccg 
gccacgcccg 
agcagagcgg 
acccgcccgg 
gtgcgccgag 
tcacgagcaa 
gctgttcggg 
cctcctgttt 
cggcatctcg 
cggacccgaa 
gcacgtcggc 
ctctgtttat 
gtgcgtcgag 
ccagccggaa 
caggcgtgtt 
cgccacttct 
agcgggtacg 
agcttgcggt 
atttcctcgt 
aagcggtgca 
gccgtcgcct 
gaccagccca 
gtgcgcttcg 
tcgacgccgg 
agcgcctcgc 
tttggcatcg 
tccttgatct 
tttgccaggt 
atggtcatcg 
gcggccgatg 
gccgtagctt 
acggtgcggc 
tgcctgtatg 
actcacgccg 
agataatcca 
t acttggtat 
ccgtgggcct 
ttgtcgccgg 
ttgttagaat 
tgattatggc 
agctccgcga 
accccccggc 
catcatgacc 
cgtggcatca 
gcccaggcgg 
gacgcccgtg 
ggccgccgcc 



6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
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gccttttcct 
cccttcctgg 
gtagccggcc 
acagtgaaga 
cgccgttgga 
gcgaaaaagg 
gcggaaaaga 
ctggcggccg 
caccgcggcc 
atgaggggcg 
ggggcaggct 
cgcctgattt 
gcggtattga 
tgaggggcag 
aggcagaaaa 
cttt t aacct 
ggcgcgtgac 
taacgcgggc 
aaaaatggca 
cgttcaaata 
tcgactgctt 
ctccgttgcc 
gaaattatct 
tgcaaagcgg 
ctacatggat 
attttttttc 
ttgataaaac 
t ctaattcgg 
tcgacctgca 
tcgccccatc 
tggaccagtt 
gatgcgtgat 
tgtctcaaaa 
aactgt ctgc 
cgtcttgctc 
gggctcgcga 
atgcgccaga 
agatggtcag 
tccgtactcc 
aggtattaga 
tgcgccggtt 
gtctcgctca 
acgagcgtaa 
tctcaccgga 
aggggaaatt 
atcttgccag 
ttcaaaaata 
atgagttttt 
cttgacggga 
cacgcatctt 
ctggtccacc 
tggggcgatt 
cctgcaggtc 
tgccccggtt 
atgatccaac 
gcttttgtcc 
gtctcagggt 
agattttcac 
tgagatatca 
gtcaataatc 
aatactgcac 



caatcgctct 
ttggcttggt 
agcctcgcag 
aggaacaccc 
tacaccaagg 
at ggatatac 
t ccgt cgacc 
tctatggccc 
ggccgccggc 
gacgttgaca 
cgatttcggc 
tacgcgagtt 
cacttgaggg 
agtgatgaca 
tccagcattt 
gcttttaaac 
cgcgcacgcc 
ctcccatccc 
ggccaagctt 
ctttgcgatc 
ctctcgcaac 
ccgtcgctca 
gcctaaccgg 
cagcggcggc 
cagcaatgag 
aattcaaaaa 
gacaaattac 
aaatctttat 
gtgaggtctg 
atccagccag 
ggtgattttg 
ctgatccttc 
tctctgatgt 
ttacataaac 
gaggccgcga 
taatgtcggg 
gttgtttctg 
actaaactgg 
tgatgatgca 
agaatatcct 
gcattcgatt 
ggcgcaatca 
tggctggcct 
tt cagtcgtc 
aataggttgt 
cctatggaac 
tggtattgat 
ctaatcagaa 
cggcggcttt 
cccgacaacg 
tacaacaaag 
caggcctggt 
gacggatctg 
tcgttgcgac 
ccgaggatat 
ccaagaatga 
taatgtctat 
tgtgcgacgc 
tcaatattat 
tgatttgaat 
tgcaaatgaa 



tcgttcgtct 
ttcatcagcc 
agcaggattc 
gctcgcgggt 
aaagt ct aca 
cgaaaaaat c 
cttt ccgacg 
tgcaaacgcg 
gttgtggata 
cttgaggggc 
cggcgacgtg 
tcccacagat 
gcgcgactac 
gatgaggggc 
gcaagggttt 
caatatttat 
gaaggggggt 
cccaggggct 
gcttggtcgt 
gtgttgcgcg 
gccatccgac 
ccgtgttggg 
ctcagttctg 
aggatatatt 
tatgatggtc 
tgtagatgt c 
gatccgtcgt 
ttcgacgtgt 
cctcgtgaag 
aaagtgaggg 
aacttttgct 
aactcagcaa 
tacattgcac 
agtaatacaa 
ttaaattcca 
caatcaggtg 
aaacatggca 
ctgacggaat 
tggttactca 
gattcaggtg 
cctgtttgta 
cgaatgaat a 
gttgaacaag 
actcatggtg 
attgatgttg 
tgcctcggtg 
aatcctgata 
ttggttaatt 
gttgaataaa 
cagaccgttc 
ctctcatcaa 
atgagtcagc 
ggggatctag 
taacatgagt 
agcaaagctc 
ggtgctatgc 
gtatcttaaa 
aaaaatatta 
aataaaaata 
atttattaat 
aattaacaca 



ggaaggcagt 
atccgcttgc 
ccgttgagca 
gggcctactt 
cgaaccct t t 
gctataatga 
ct caccgggc 
ccagaaacgc 
ccacgcggaa 
cgact caccc 
gagctggcca 
gatgtggaca 
tgacagatga 
gcacctattg 
ccgcccgttt 
aaaccttgtt 
gccccccctt 
gcgcccctcg 
tccgcgtgaa 
cctgcccggt 
ggatgatgtt 
gggaaggtgc 
cgtagaaacc 
caattgtaaa 
aatatggaga 
cgcagcgtta 
atttataggc 
ctacattcac 
aaggtgttgc 
agccacggtt 
ttgccacgga 
aagttcgatt 
aagataaaaa 
ggggtgttat 
acatggatgc 
cgacaatcta 
aaggtagcgt 
ttatgcctct 
ccactgcgat 
aaaatattgt 
attgtccttt 
acggtttggt 
tctggaaaga 
atttctcact 
gacgagtcgg 
agtgttctcc 
tgaataaatt 
ggttgtaaca 
tcgaactttt 
cgtggcaaag 
ccgtggctcc 
aacaccttct 
cagatccgcg 
tcttggacaa 
gtt cgtgcag 
atgaaggaat 
taatgttgtc 
aataaatatt 
tccattaaac 
tgtaacgaat 
tactaataaa 



acaccttgat 
cctcatctgt 
ccgccaggtg 
cacctatcct 
ggcaaaat cc 
ccccgaagca 
tggttgccct 
cgtcgaagcc 
aacttggccc 
ggcgcggcgt 
gcctcgcaaa 
agcctgggga 
ggggcgcgat 
acatttgagg 
ttcggccacc 
tttaaccagg 
ct cgaaccct 
gccgcgaacg 
cgtcggctcg 
gcgtcggctg 
taaaagtccc 
acatggctca 
aacatgcaag 
tggcttcatg 
aaaagaaaga 
ttataaaatg 
gaaagcaata 
gtccaaatgg 
tgactcatac 
gatgagagct 
acggtctgcg 
tattcaacaa 
tatat cat ca 
gagccatatt 
tgatttatat 
tcgattgtat 
tgccaatgat 
t ccgaccatc 
ccctgggaaa 
tgatgcgctg 
taacagcgat 
tgatgcgagt 
aatgcataaa 
tgataacctt 
aatcgcagac 
ttcattacag 
gcagttt cat 
ctggcagagc 
gctgagttga 
caaaagttca 
ctcactttct 
tcacgaggca 
aggggatcga 
atttgattgg 
caatggaacg 
ct acccgttg 
ggtattttgt 
attattatct 
acgatttgat 
tacataaaga 
tgcgtcaaat 



aggtgggctg 
tacgccggcg 
cgaataaggg 
gcccggctga 
t gt atat cgt 
gggttatgca 
cgccgctggg 
gtgtgcgaga 
tcactgacag 
tgacagatga 
t cggcgaaaa 
taagtgccct 
ccttgacact 
ggctgtccac 
gctaacctgt 
gctgcgccct 
cccggcccgc 
gcctcacccc 
attgtacctg 
atctcacgga 
atgtggatca 
gttct caatg 
ct ccaccggg 
t ccgggaaat 
gtaattacca 
aaagtacatt 
aacaaattat 
gggagat ccg 
caggcctgaa 
ttgttgtagg 
ttgtcgggaa 
agccacgttg 
tgaacaataa 
caacgggaaa 
gggtataaat 
gggaagcccg 
gttacagatg 
aagcatttta 
acagcatt cc 
gcagtgttcc 
cgcgtatttc 
gattttgatg 
cttttgccat 
atttttgacg 
cgataccagg 
aaacggcttt 
ttgatgctcg 
attacgctga 
aggatcagat 
aaatcaccaa 
ggctggatga 
gacctcagcg 
gcccgacata 
acctgatgag 
gccaaaccgt 
atgtccaaca 
aatctcatat 
acgtvttgat 
acaaatgaca 
tcgaatagaa 
atctttgcca 



10020 
10080 
10140 
10200 
10260 
10320 
10380 
10440 
10500 
10560 
10620 
10680 
10740 
10800 
10860 
10920 
10980 
11040 
11100 
11160 
11220 
11280 
11340 
11400 
11460 
11520 
11580 
11640 
11700 
11760 
11820 
11880 
11940 
12000 
12060 
12120 
12180 
12240 
12300 
12360 
12420 
12480 
12540 
12600 
12660 
12720 
12780 
12840 
12900 
12960 
13020 
13080 
13140 
13200 
13260 
13320 
13380 
13440 
13500 
13560 
13620 
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agatcaagcg 
cgcgct ccac 
ttgagcatgg 
ccgcgcccag 
aactccagca 
ccgaagccca 
ggcgtcgctt 
ggcgatagaa 
ggtcagccca 
gatagcggtc 
ccaccatgat 
gcatgcgcgc 
ccagatcatc 
gtttcgcttg 
catcagccat 
ccggcacttc 
ctgcgcaagg 
cattcagggc 
gccggaacac 
gcctctccac 
aggat ctcga 
gccttttgat 
aaccttattg 
ttcaacatca 
cgaatctcct 
cagaaaatga 
t gtctctttg 
gtccccagat 
cttacgcagc 
accttcccaa 
cagagaaaga 
t gcttcacaa 
aatcaaaggc 
ct ggcgaaca 
acatggtgga 
accaaagggc 
attgcccagc 
aatgccatca 
ccaaagatgg 
cttcaaagca 
actatccttc 

gggggactct 

gagttgtccc 
gtggagaggg 
ctggaaaact 
gcttttcccg 
agggatacgt 
ctgaagtcaa 
tcaaggagga 
tatacatcac 
acattgaaga 
atggccctgt 
atcccaacga 
cacatggcat 
tttggcaata 
aatttctgtt 
tgagatgggt 
aaatatagcg 
gatcgggaat 

<210> 33 



gagtgagggc 
caatgccct c 
t ctcagcgcc 
tccggtgtaa 
tgagatcccc 
acctttcata 
ggt cggtcat 
ggcgatgcgc 
tt cgccgcca 
cgccacaccc 
attcggcaag 
cttgagcctg 
ctgatcgaca 
gtggtcgaat 
gatggatact 
gcccaatagc 
aacgcccgt c 
accggacagg 
ggcggcatca 
ccaagcggcc 
tccccgggct 
at cagccaca 
aattctcttg 
aatgaatagt 
ttgtaattga 
acaggagtta 
tcattgctgt 
tagccttttc 
aggtctcatc 
gaaggttaaa 
tatatttctc 
accaaggcaa 
catggagtca 
gttcatacag 
gcacgacacg 
aattgagact 
tatctgtcac 
ttgcgataaa 
acccccaccc 
agtggattga 
gcaagaccct 
agaggatccg 
aattcttgtt 
tgaaggtgat 
acctgttcca 
ttatccggat 
gcaggagagg 
gtttgaggga 
cggaaacatc 
ggcagacaaa 
tggat ccgtt 
ccttttacca 
aaagcgtgac 
ggatgagcta 
aagtttctta 
gaattacgtt 
ttttatgatt 
cgcaaactag 
tagatc 



ct cat atccg 
gacat agatg 
ggctttaagc 
gaacgggtct 
gcgct ggagg 
gaaggcggcg 
ttcgaacccc 
tgcgaatcgg 
agctcttcag 
agccggccac 
caggcatcgc 
gcgaacagtt 
agaccggctt 
gggcaggtag 
ttctcggcag 
agccagtccc 
gtggccagcc 
tcggtcttga 
gagcagccga 
ggagaacctg 
gattttctca 
agtgtgtggg 
aaaggaagtc 
ggct aagcca 
gattaacgaa 
gataaatact 
ggtaattctt 
aattt cagaa 
aagacgatct 
gatgcagtca 
aagatcagaa 
gtaatagaga 
aagattcaaa 
agtctcttac 
cttgtctact 
tttcaacaaa 
tttattgtga 
ggaaaggcca 
acgaggagca 
tgtgatat ct 
tcctctatat 
gatccgtcga 
gaattagatg 
gctacatacg 
tggccaacac 
catatgaagc 
accatctctt 
gacaccctcg 
ctcggccaca 
caaaagaatg 
caactagcag 
gacaaccatt 
cacatggtcc 
tacaaataaa 
agattgaatc 
aagcatgtaa 
agagtcccgc 
gataaattat 



gtct cagtta 
ccgggctcga 
tcaatcccat 
gtccatccac 
atcatccagc 
gtggaat cga 
agagtcccgc 
gagcggcgat 
caatatcacg 
agtcgatgaa 
catgggtcac 
cggctggcgc 
ccatccgagt 
ccggatcaag 
gagcaaggtg 
ttcccgcttc 
acgatagccg 
caaaaagaac 
ttgtctgttg 
cgtgcaatcc 
gtctccagag 
aatcttatct 
cgcaaagtgg 
tgcttgccat 
actcttgaaa 
aaatttcttt 
ttctgcagct 
agaatgctaa 
acccgagcaa 
aaagattcag 
gtactattcc 
ttggagtctc 
tagaggacct 
gactcaatga 
ccaaaaatat 
gggtaatatc 
agatagtgga 
tcgttgaaga 
tcgtggaaaa 
ccactgacgt 
aaggaagttc 
ccatgagcaa 
gtgatgttaa 
gaaagcttac 
ttgtcactac 
ggcacgactt 
tcaaggacga 
tcaacaggat 
agttggaata 
gaat caaagc 
accattatca 
acctgtcgac 
ttcttgagtt 
gatcctcgaa 
ctgttgccgg 
taattaacat 
aattatacat 
cgcgcgcggt 



caagcacggt 
cgctgaggac 
cccaatctga 
ctctgttggg 
cggcgtcccg 
aatctcgtga 
t cagaagaac 
accgtaaagc 
ggtagccaac 
tccagaaaag 
gacgagatcc 
gagcccctga 
acgtgctcgc 
cgtatgcagc 
agatgacagg 
agtgacaacg 
cgctgcct eg 
cgggcgcccc 
tgcccagtca 
atcttgttca 
atgtgtttaa 
tcggatttca 
ttgtcggttc 
ctgatgtttc 
tgagtattat 
atagtattta 
ttaaagct ca 
cccacagatg 
taatctccag 
gactaactgc 
agtatggacg 
taaaaaggta 
aacagaactc 
caagaagaaa 
caaagataca 
cggaaacctc 
aaaggaaggt 
tgcctctgcc 
agaagacgtt 
aagggatgac 
atttcatttg 
aggagaagaa 
tgggcacaaa 
ccttaaattt 
tttctcttat 
cttcaagagc 
cgggaactac 
cgagcttaag 
caactacaac 
taacttcaaa 
acaaaatact 
acaatctgcc 
tgtaactgct 
tttccccgat 
tcttgcgatg 
gtaatgcatg 
ttaatacgcg 
gtcatctatg 



at Gcccgaag 


13680 


at "iigcct acc 


13740 


atat cctatc 


13800 


gtgggcgaag 


13860 


gaaaacgatt 


13920 


tggcaggtt g 


13980 


t cgt caagaa 


14040 


acgaggaagc 


14100 


gctatgtcct 


14160 


cggccatttt 


14220 


t cgccgtcgg 


14280 


tgctcttcgt 


14340 


tcgatgcgat 


14400 


cgccgcattg 


14460 


agatcctgcc 


14520 


tcgagcacag 


14580 


tcctgcagtt 


14640 


tgcgctgaca 


14700 


tagccgaata 


14760 


atcatgctaa 


14820 


ataggcagta 


14880 


attaggaatt 


14940 


ctttaatgtg 


15000 


ttccaagt tt 


15060 


ctgtcctaat 


15120 


ttaccgccaa 


15180 


gatcttgcag 


15240 


gttagagagg 


15300 


gagatcaaat 


15360 


atcaagaaca 


15420 


attcaaggct 


15480 


gttcccactg 


15540 


gccgtaaaga 


15600 


atcttcgtca 


15660 


gtctcagaag 


15720 


ctcggattcc 


15780 


ggctcctaca 


15840 


gacagtggtc 


15900 


ccaaccacgt 


15960 


gcacaat ccc 


16020 


gagagaacac 


16080 


cttttcactg 


16140 


ttttctgtca 


16200 


atttgcacta 


16260 


ggtgttcaat 


16320 


gccatgcctg 


16380 


aagacgcgtg 


16440 


ggaatcgatt 


16500 


t cccacaacg 


16560 


attcgccaca 


16620 


ccaattggcg 


16680 


ctttcgaaag 


16740 


gctgggatta 


16800 


cgttcaaaca 


16860 


att atcatat 


16920 


acgttattta 


16980 


atagaaaaca 


17040 


ttactagatc 


17100 




17116 
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<211> 646 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: 3' MAR in 
pAf-Act2-Af 



<400> 33 

gggggatctc 

cgtccatgaa 

ct ggtaaata 

tttaagatta 

aatttttaaa 

ttgtaaaact 

gatttatata 

tatttccctg 

tttttgattc 

atgaagtcag 

aacttaattt 



gatatcttta 
aaaaaatttt 

ttaattagtc 
tatagctcca 
aagcacactt 
cttgcaggat 
t gacgtccat 
gagctggtaa 
ctttttaaga 
caaaattttt 
cacttgtaaa 



atttctaara 
aaaacgat ag 
ct ctccctt c 
tgccaagcr g 
gacttggtat 
ctcgatat ct 
gaaaaaaaat 
atattaatta 
ttatatagct 
aaaaagcaca 
actcttgcag 



tatttagaag 
gccagct oca 
taatatattt 
acttcctaat 
atatttataa 
ttaatttcta 
tttaaaacga 
gtcctctccc 
ccatgccaag 
ctt gacttgg 
gatccactag 



gggtttagat 
aagaatatat 
ttcgatattt 
atattttatg 
atgtttaaac 
atatatttag 
taggccagct 
tt ctaatata 
ctgacttcct 
tatatattta 
ttctag 



ttatatatga 
ttccctggag 
ttgattcctt 
aagtcagcaa 
ttaatttcac 
aaggggttta 
ccaaagaata 
tttttcgata 
aatatatttt 
taaatgttta 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

646 
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